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Pillars of reform 


The Chinese government’s planned overhaul of its core research-funding system is vital if the 
country is to achieve its potential on the global scientific stage. 


uses to invest in science and technology is broken. The nation’s 

past efforts to become a great innovative state have produced 
clear signs of success: a flourishing space programme, a dominant 
global position in genome sequencing and some internationally 
prominent technology companies. But scratching beneath that sur- 
face reveals a creeping suspicion that China is not getting value for the 
money it diverts into research. 

These misgivings seem to have reached crisis point. China today 
is full of new initiatives, reforms and an anti-corruption drive that 
together aim to set the nation on the right track. The impact on science 
is set to be monumental — if China follows it through. 

The problems that lead to inefficiency in science are various. 
Research programmes overlap. Low-quality researchers, often selected 
more than a decade ago when they were the best that China could get, 
are now a dead weight in a system that has since managed to acquire 
much better. And worryingly, earlier this month, the government 
accused researchers at some of the country’s most prestigious uni- 
versities of misusing research funds (see Nature http://doi.org/wpb; 
2014). Shutting down such egregious abusers is a first step, and the 
ten-year prison terms handed down to two scientists who diverted 
funds certainly indicate that China is determined to make others think 
twice before they forge receipts. 

The nation is also right to reform how the Chinese Academy of Sci- 
ences supports promising research projects. And perhaps most boldly 
— although the details are still sketchy — the finance and science 
ministries last week announced a joint effort to completely overhaul 
the way competitive funding is distributed. 

In principle, this could drive improvements in the crucial area of 
how research and researchers are assessed. Critics have long argued 
that small grants in China are reviewed strictly, whereas large grants 
receive little or no review. The large grants, critics charge, are too often 
decided on the basis of personal connections, not scientific validity. 
The latest reform aims to stop the science ministry awarding these 
large grants and will instead distribute them through a new (albeit 
still vaguely defined) ‘five-pillar systenv. 

The key to China's new system is how it will judge which research 
is worth pursuing. How can it promote the truly innovative and feasi- 
ble while suppressing science that is either not original or unfeasible? 
This is not a new problem. Policy-makers and leaders of the science 
community worldwide grapple with these questions. The topic was 
again addressed at a meeting on research assessment and evaluation 
in Shanghai last week, co-sponsored by Nature. 

Discussions there ranged from the evaluation of young research- 
ers to the difficulty of balancing societal impact with research that 
drives innovation. For instance, Nobel-prizewinning biophysicist 
Kurt Wiithrich argued for focusing less on past achievements and 
failures, and more on “talented underachievers” — those with great 


Cs has come to an unsettling conclusion: the system that it 


ideas who just might not have proved themselves yet. 

Assessment of research is an especially urgent issue for China as it 
attempts to leave behind cronyism and an incestuous grant system. 
It would be too easy to fall back on hyper- quantitative assessment 
—a system that results in scientists running from one project to the 
next, trying to join as many as they can to maximize the number of 
papers and awards that they rack up. 

The reform of the Chinese Academy of Sciences is meant to change 
that. An initiative in translational research, launched earlier this 

month (see page 547), has put recruitment of 


“The impact on the best researchers at the centre of its plan- 
science is set to ning — and has arranged an impressive inter- 
be monumental national committee to ensure that happens. 
— if China Will the reform of China’s core competitive 
follows it funding system work? It is a promising sign 
through. ” that the National Natural Science Founda- 


tion of China, the well-respected body that 
distributes the smaller grants, will be one of the five pillars now tasked 
with broader research assessment. 

But what about the other pillars? Will they be the old fragmented 
divisions of the science ministry with new names, or will they really be 
streamlined units that are determined to make every renminbi count? 
How the government acts on that point will largely determine the 
future of research in China. 

In doing so, it will influence the rate at which China’s overseas 
scientists are willing to head back home, and whether foreign scientists 
will respond to China's attempts to lure them. Such reforms could also 
help to resolve one of the big questions in modern science: ifand when 
the impact of research in China will surpass that of the United States. m 


Call to action 


Time to ramp up science’s contribution to 
controlling the Ebola outbreak. 


continued to spread. Research has deferred to the need to gear 

up the public-health response. But there is a growing sense that, 
unless science can somehow now change the game, the outbreak will 
be difficult to bring under control. 

The Ebola virus has killed more than 4,800 people in six countries, 
and has affected people in another two, the latest being Mali. There 
are still not enough medical staff and treatment beds to handle the 
current caseload. The World Health Organization (WHO) projects that 


S cience has so far taken a back seat as the Ebola outbreak has 
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as many as 10,000 new cases could be arising per week by December 
if the outbreak is not turned around. 

Enter science. Speeding the development of treatments and vaccines 
is one area in which the international community is trying to move 
forward. On 22 October, the US Biomedical Advanced Research and 
Development Authority and the US Army awarded US$17.1 million 
to Profectus BioSciences, a company based in Baltimore, Maryland, 
that is developing vaccines against Ebola based on vesicular stomatitis 
virus. It is the third candidate Ebola vaccine to have moved towards or 
into clinical trials this year. On 24 October, the WHO outlined plans 
to test the first two — one licensed to NewLink Genetics of Ames, 
Iowa, the other being developed by GlaxoSmithKline, headquartered 
in London. These two vaccines have already entered human safety tri- 
als and the WHO says that they could be tested in health-care workers 
and others in West Africa as early as December. 

In the meantime, aid agencies such as Médecins Sans Frontiéres 
(also known as Doctors Without Borders) and researchers funded by 
the European Union will test candidate Ebola treatments, including 
experimental drugs, medicines already approved for other uses that 
could be made available ‘off label’ and purified plasma or blood from 
Ebola survivors. 

Beyond treatments and vaccines, scientists have more fundamental 
questions, about both the Ebola virus behind the current outbreak 
and other viruses in the family to which it belongs, the filoviruses. 
This group includes Marburg virus, also capable of causing a lethal 
haemorrhagic fever, which killed a Ugandan health-care worker 
on 28 September. A third filovirus outbreak occurred this year in 
the Democratic Republic of the Congo, where an Ebola outbreak 
unrelated to that in West Africa has killed 49 people. 

The emergence of three filovirus outbreaks this year and the 
increasing frequency and reach of such outbreaks — which have 
occurred every year except 2 in the past 21 years — should serve 
as the clearest warning possible: we urgently need to understand 
more about the pathology, distribution, epidemiology and clini- 
cal aspects of these viruses. A World View on page 537 argues that 
such science should help to steer the response; a News Feature on 
page 554 lays out the five most pressing questions about the filo- 
viruses, and says why answering them might help to prevent a future 
outbreak or even help to bring this one under control. 


For instance, new filoviruses have been discovered within the 
past five years, such as the Lloviu virus discovered in 2011 in bats 
in Spain. And scientists have learnt that these viruses have a much 
more widespread distribution than was suspected. The Reston virus, 
for example, an ebolavirus that does not seem to harm humans, 
has turned up in recent years in pigs in both the Philippines and China. 
Scientists suspect that there are more of these viruses to be found, in 
more places, and urgently want to understand why some are lethal to 

humans and others are not — and whether 


“Whether it that could change. 

is Ebola virus It is also not known which animals 
or something harbour Ebola virus in the wild, or how the 
completely first person infected in the West African 
different, there outbreak last December contracted the 
will be anext disease. Understanding this is crucial if 
time.” people are to avoid a possible reservoir in 


the future. 

It has been difficult to answer these questions for many reasons, 
such as the (fortunate) relative rarity and unpredictability of human 
filovirus outbreaks. And laboratory studies require highly contained, 
specialized biosafety-level-4 (BSL-4) labs — of which there are too 
few around the world. 

Thanks to a biodefence building boom over the past decade, there 
are now 13 such labs planned or operating in the United States. 
Canada, France, Australia, Germany, the United Kingdom, South 
Africa, Gabon and Russia are among the select nations that also have 
such facilities. But there are major research-funding nations, such as 
Japan, that do not have BSL-4 labs, or do not allow them to perform 
the highest-containment research because of worries that pathogens 
could escape and spark lethal local epidemics. 

The current Ebola outbreak proves the fallacy of that decision. The 
world would not be in the position it is today, with the possibility of 
deploying an Ebola vaccine during the current outbreak, without the 
existence of both high-containment facilities and money for research 
on diseases that are, thankfully, rare in developing countries. More of 
both, in more places, can only hasten our understanding of Ebola and 
other diseases. Because one thing is clear: whether it is Ebola virus, 
another filovirus or something completely different, there will be a 
next time. m 


Code share 


Papers in Nature journals should make 
computer code accessible where possible. 


reproducibility of our research papers is that key components 
of publications should be available to peers who wish to vali- 
date the techniques and results. 

A core element of many papers is the computer code used by authors 
in models, simulations and data analysis. In an ideal world, this code 
would always be transportable and easily used by others. In sucha world, 
our editorial policy would be to insist on sharing to allow free use, as 
we already do (as far as is practicable) with data and research materials. 
Unfortunately, such an ideal is not easy to attain owing to the amount of 
extra funding and effort it would require to render some major pieces 
of code shareable. Nevertheless, we at Nature and the Nature research 
journals want to encourage as much sharing as possible. 

Climate modellers have made some strides in this regard. The journal 
Geoscientific Model Development has a good example of such a policy 
(see go.nature.com/jv8glw), and an article in Nature Geoscience dis- 
cusses some of the opportunities presented by code sharing, as well as 


A theme in Nature’s ongoing campaign for the replicability and 
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the obstacles (S. M. Easterbrook Nature Geosci. 7,779-781; 2014). 

Asa leading example of transparency policies in other disciplines, 
the data journal GigaScience requires code used in its papers to be 
available, and hosts it in a way that allows others to analyse the data in 
publications. One point made by Easterbrook is that even ifthe code 
is shared, others might often make little or no use of it, but on some 
occasions the take-up will be large. 

Nature and the Nature journals have decided that, given the diversity 
of practices in the disciplines we cover, we cannot insist on sharing com- 
puter code in all cases. But we can go further than we have in the past, by 
at least indicating when code is available. Accordingly, our policy now 
mandates that when code is central to reaching a paper's conclusions, we 
require a statement describing whether that code is available and setting 
out any restrictions on accessibility. Editors will insist on availability 
where they consider it appropriate: any practical issues preventing code 
sharing will be evaluated by the editors, who reserve the right to decline 
a paper if important code is unavailable. Moreover, we will provide a 
dedicated section in articles in which any information on computer 
code can be placed. And we will work with individual communities to 
put together best-practice guidelines and possibly more-detailed rules. 

For full details, see our guide for authors at 
go.nature.com/o5ykhe. For an archive of our 
content and initiatives concerning reproduc- 
ibility, see http://www.nature.com/nature/focus/ 
reproducibility. m 
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superstition. We see this in Africa: the murder last month of 

eight people working to raise awareness of the disease close to 
the town of Nzerekore, in southeastern Guinea, is a tragic example. 
But we see it in the supposed scientific superpowers too. 

In the United States, for example, the debate over how the nation 
should respond is being undermined by media hype and political panic 
over domestic cases; there are calls for total bans on flights from the out- 
break region. And here in Australia, the government is reluctant to send 
people to help to control the spread of the disease until it can get guar- 
antees from nations closer to the outbreak that those people would be 
able to receive treatment in the event that they became infected. Worse, 
health-care workers who voluntarily went to the affected region to help 
have been criticized on their return to Australia. 
These people — including a nurse who developed 
a fever after her return (but tested negative for 
Ebola) — have been turned on by politicians and 
the press for putting fellow Australians at risk, 
despite having followed expert guidance and 
quarantined themselves. 

The threat posed by a few imported cases 
of Ebola is low: Australia has the procedures, 
resources and facilities here to minimize the risk 
of secondary transmission. And other developed 
countries are willing to accept that risk because 
they understand the crucial and urgent need to 
tackle the outbreak in Africa. Mathematical mod- 
els predict that transmission will continue for 
many months, even if the rate of expansion levels 
out soon. The large number of infections clearly 
increases the probability of exports and that the 
viruss genome will continue to mutate. The real issue is that the threat 
to Australia, the United States and other developed countries will be 
much higher in six months. The best defence is to act now and in Africa. 

I believe that Australia could handle isolated cases. Why do politi- 
cians and the public not seem to share that confidence? It is true that 
we do not understand the Ebola virus very well (see page 554), and 
when science is lacking, the default response to infectious disease is 
often fear. But before we are too critical, scientists must realize that 
some of the public and political response is driven by the mystique 
that we have created around Ebola and other viral haemorrhagic 
fever agents. 

For example, when the outbreak began, several biomedical research 
centres in Australia promptly established procedures for screening 
possible cases — all negative so far. Eager to reas- 


Ts Ebola outbreak has pitted rational science against fear and 
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Developed nations must 
not fear sending Ebola help 


The anxiety and stigma associated with Ebola are hampering Australia’s 
willingness and ability to help with the control efforts in Africa, argues Tim Inglis. 


images of staff entering secure facilities in full Tyvek protection suits 
was anything but. Such footage does little to challenge a popular view 
informed as much by disaster movies as by molecular biology. In fact, 
it could have inadvertently made the situation worse and entrenched 
a national preoccupation with border protection. 

Research on Ebola lags behind the clinical need, but much has been 
learned about its virology, immunology and molecular epidemiology 
in the past nine months. Here, science can suggest a way to improve 
the international response. At present, lab resources — including those 
sent from abroad — tend to be located in disease hot spots. But if we 
want to restrict the outward spread of the virus, it makes little sense to 
busy these screening facilities — and new ones to be deployed — with 
samples that have a high probability of testing positive. We should focus 
support on rapid turnaround of tests for the field 
epidemiology teams working at the edge of the 
outbreak zone. 

This would take diagnosis to where patients 
are most likely to become infected next, rather 
than sending those people or their blood samples 
to centres where the disease is concentrated. This 
could help to steer the countermeasure response 
to where it is likely to have the most impact. In 
view of the escalating case load and mortality 
figures, it is an experiment we should try as a 
matter of priority. 

Crucial to such a response are mobile labs that 
can follow the disease control teams. My team 
has one. We have developed and tested a mobile 
molecular lab in other emerging infectious-dis- 
ease settings, including epidemic influenza. It 
is a small technical step to transfer the standard 
(fixed) lab assay for Ebola virus RNA to a portable thermocycler device. 

Colleagues have volunteered and we are set to test the mobile lab in 
the tropical conditions of northern Western Australia in the coming 
weeks. We hope to be working in West Africa by the end of the year. 

What will we bring home with us? Unlike clinical staff, we are 
unlikely to have to have much contact with actual patients. The 
blood samples we handle could be highly infectious, but I argue that 
researchers who understand the biology of the Ebola virus have a 
healthy respect for the occupational safety risks, and know how to 
mitigate those risks. Australia can therefore make an effective con- 
tribution to the control efforts with little risk to her own population. 

Past experience has shown me that effective infection control needs 
a strong grounding in science. Scientists are needed at the front line. 
The risk is worth it. = 


Tim Inglis is a medical microbiologist at the University of Western 
Australia in Nedlands. 
e-mail: tim.inglis@uwa.edu.au 
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Mercury’s ice isa 
recent arrival 


Ice at Mercury’s poles is a 
relatively new arrival —a 
finding that could help to 
resolve a debate about whether 
ice may have survived for 
billions of years on the planet 
closest to the Sun. 

Using data from NASA's 
MESSENGER spacecraft, 
Nancy Chabot of the Johns 
Hopkins University Applied 
Physics Laboratory in Laurel, 
Maryland, and her colleagues 
studied how light scattered 
inside dimly lit polar craters. 
They found that ina northern 
crater called Prokofiev, highly 
reflective ice drapes over 
the underlying topography. 
This suggests that the ice 
has appeared on the surface 
relatively recently. 

This ice was either delivered 
to Mercury, perhaps by 
comets, or churned up from 
below by impacts battering its 
surface, the authors conclude. 
Geology http://doi.org/wijf (2014) 


MYCOLOGY 


Teamwork helps 
yeast to infect 


Immune responses that should 
combat a disease caused by 
yeast instead make the fungus 
grow, potentially worsening the 
infection. 

Robin May at the University 
of Birmingham, UK, and his 
co-workers studied strains of 
Cryptoccocus gattii, which can 
cause meningitis and other 
problems. 

They found that oxygen- 
containing molecules produced 
by the host as a defence 
mechanism cause some of 
the fungal cells to become 
quiescent and resist immune 
responses. The remaining cells 
proliferate quickly, resulting 
in an overall increase in cell 


Selections from the 
scientific literature 


EVOLUTION 


Lizards adapt quickly to invaders 


Lizards in Florida have rapidly evolved traits 
that make them better tree-climbers, probably 
in response to an invasive competitor. 

Cuban brown anole lizards (Anolis sagrei) 
have spread over the past few decades across 
the southeastern United States, where they 
compete for territory and food with the native 
green anole (Anolis carolinensis; pictured). Yoel 
Stuart at the University of Texas, Austin, and 
his colleagues introduced small populations of 
the invader to three islands in a central Florida 


numbers. The resistant cells 
contained energy-producing 
organelles called mitochondria 
that were tubular in shape. 

The findings suggest that the 
fungal cells act as a team during 
infection, with non-dividing 
cells helping neighbouring ones 
to grow rapidly. 

Nature Commun. 5, 5194 (2014) 


Mysterious signals 
may be from Earth 


Radio pulses that look like they 
came from deep space could 
actually have earthly origins. 

A team led by Pascal 
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Saint-Hilaire at the University 
of California, Berkeley, 
detected five short but intense 
radio bursts at the Bleien Radio 
Observatory in Switzerland. 
This is only the second location 
at which such pulses have been 
detected, and their origins are 
still unclear. Characteristics 
of the radio waves suggest 
that they were stretched after 
passing through vast amounts 
of plasma — usually indicating 
an origin outside of the Milky 
Way, such as exploding stars in 
other galaxies. 

However, the pulses 
were detected only when 
the antenna was in a mode 
susceptible to ground 


lagoon and found that the native green anoles 
perched higher in trees than native lizards 
on nearby islands that had not been invaded. 
After just 20 generations of anoles, the team 
found that native lizards on invaded islands had 
evolved larger toepads, probably to help them 
cling to less-secure branches farther up in trees. 
Competition between closely related species 
can drive rapid, observable evolutionary 
change, the authors say. 
Science 346, 463-466 (2014) 


interference, and all but one 
pulse occurred in the late 
morning. This suggests that 
the signals could actually be 
coming from sources on Earth. 
Astrophys. J. 795, 19 (2014) 


Tumours linked to 
cellular rubbish 


Discarded rubbish from 
tumours could trigger nearby 
healthy cells to become 
malignant. 

Many cells shed exosomes: 
membrane-bound packages 
of proteins, DNA and RNA 
that are thought to bea 


GERRY ELLIS/MINDEN PICTURES/FLPA 


YONG-LAI ZHANG, HONG-BO SUN/JILIN UNIV. 
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waste-management system. 
Raghu Kalluri at the University 
of Texas MD Anderson 
Cancer Center in Houston 
and his colleagues found that 
exosomes from cancer cells 
contain the building blocks 
for short RNA fragments that 
can shut off gene expression. 
Healthy cells that were exposed 
to cancer exosomes in culture 
caused tumours when the 
cells were later injected into 
mice, whereas cells exposed to 
normal exosomes did not. 
Samples of exosomes from 
people with breast cancer also 
caused tumours in 5 out of 
11 mice treated. This finding 
could aid the discovery 
of markers for disease 
progression, or even represent a 
route to new cancer therapies. 
Cancer Cell http://doi.org/wkg 
(2014) 


Molecular sponges 
store oxygen 


An oxygen cylinder could hold 
even more gas if it were filled 
with sponge-like powders, 
chemists report. 

The powders are a type of 
metal-organic framework 
(MOF): sponge-like materials 
in which metal atoms are 
connected by organic groups, 
creating a porous network with 
many promising applications 
including gas storage. Omar 
Farha of Northwestern 
University in Evanston, 
Illinois, and his colleagues 
simulated oxygen adsorption 
(pictured, yellow spheres) on 
10,000 MOFs and selected two 
to test. They experimentally 
showed that the MOFs could 
store and release oxygen over 
50 cycles and outperformed 


zeolites, another kind of 
chemical sponge. 

The structures could allow 
soldiers or medical teams to 
carry oxygen using smaller, 
lighter containers that operate 
at lower pressures than 
cylinders, the team says. 
Angew. Chem. Int. Edn http://doi. 
org/f2vn85 (2014) 


GLACIOLOGY 


Channels hint at 
glacier hardiness 


Ancient channels preserved 
beneath the West Antarctic Ice 
Sheet suggest that part of the 
glacier prevailed during warm 
periods more than two million 
years ago. 

By combining radio-echo 
soundings of the landscape 
underneath the glacier with 
satellite images of the ice 
surface, Kathryn Rose of the 
Bristol Glaciology Centre, 

UK, Martin Siegert at Imperial 
College London and their 
colleagues discovered a series 
of ancient wide, shallow 
channels. These suggest that, 
historically, there was a large 
flow of meltwater from the 

ice surface. The most recent 
period that was warm enough 
to generate such melting was 
during the Pliocene, 5.3 million 
to 2.6 million years ago. 

Parts of this ice sheet, which 
some studies have predicted. 
will destabilize in a warmer 
climate, may have existed at 
intervals during a period that 
was 2°C warmer than now, the 
authors say. 

Geology http://doi.org/wkt 
(2014) 


Paper-based 
gene tools 


Functional biological circuits 
can be printed on paper, 
reports a team led by James 
Collins at Boston University in 
Massachusetts. 

The team synthesized 
cell-free gene networks from 
off-the-shelf parts and freeze- 
dried them on to paper. When 
later rehydrated, the networks 
worked as programmable 
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SOCIAL SELECTION 


Popular articles 
on social media 


Rules for reproducibility win support 


Nearly a decade after writing a scathing critique of biomedical 
research, ‘Why Most Published Research Findings Are False; 
Stanford University scientist John Ioannidis has published a 
follow-up. The health-policy researcher suggests a blueprint 
for making scientific results more reliable, including increasing 
the statistical certainty of discoveries, giving more weight to 
negative results and changing how researchers earn kudos. 
Many commenters chimed in with support for his paper, 
even if they did not believe that change could come easily. 
Simon Wheeler, a public-health nutritionist at the University 
of Cambridge, UK, endorsed Ioannidis'’s suggestions, tweeting 
that scientists should be “creating a culture where these are 
norms and expectations, not just lofty ideals”. Mick Watson, a 
computational biologist at the University of Edinburgh's Roslin 
Institute, UK, tweeted, “I’m totally with John Ioannidis when 
he says the scientific reward system needs to change.” 


PLoS Med. 11,e1001747 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 
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in vitro diagnostics with a 
colorimetric output. The team 
demonstrated the technology 
by using the circuits to detect 
glucose and to distinguish 
between RNA fragments from 
two related virus species of the 
Ebolavirus genus, called Sudan 
and Zaire. 

The paper technology 
should allow the easy and 
low-cost use of synthetic 
biology outside the lab, the 
team predicts. 

Cell http://doi.org/wkr (2014) 


Sunshine drives 
graphene machines 


Machines that move by 
bending in response to 
moisture can be made by 
exposing thin sheets of 
graphene oxide to sunlight. 
A team led by Hong-Bo 
Sun at Jilin University in 
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Changchun, China, focused 
sunlight on one side of 
graphene oxide paper. The 
ultraviolet radiation induced 
a reaction that negatively 
charged the surface so that it 
repelled water more strongly 
than the layer below, causing 
the paper to curl in seconds on 
contact with moisture. 

The team fashioned a 
claw from the paper that 
closes when approached by 
a sweaty finger, and a paper 
robot (pictured) that crawls 
when the humidity in its 
environment is raised and 
lowered. The authors say that 
the material could be used in 
devices including sensors and 
smart textiles. 
Adv. Mater. http://doi.org/wjj 
(2014) 
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China’s Moon shot 
China launched an uncrewed 
lunar orbiter on 24 October, 
from the Xichang launch centre 
in Sichuan province. The probe 
will slingshot around the Moon 
and is scheduled to land in 
northern China after an eight- 
day journey, becoming the 
nation’s first round-trip lunar 
mission. The mission aims to 
demonstrate heat-shield and 
guidance technologies for a 
planned 2017 lander, which 
will be China's first to bring 
samples back to Earth. 


Ebola transported 
Health officials in Mali 
reported on 24 October that 
the nation’s first confirmed 
person with Ebola — a two- 
year-old girl who had recently 
arrived from Guinea — had 
died. The girl began showing 
symptoms such as nosebleeds 
while still in Guinea, and 
started out for Mali with her 
grandmother on 19 October. 
Many opportunities for high- 
risk exposures occurred while 
the child travelled extensively 
by public transport, says the 
World Health Organization, 
which called the situation 

an emergency. An initial 
investigation has identified 

43 close, unprotected contacts, 
including 10 health-care 
workers. 


Hunt fraudsters 


The US government has 
charged two South Africans 
with conspiracy to sell places 
on illegal rhinoceros hunts 

by misleading US hunters. 
According to an indictment 
released on 23 October, 
brothers Dawie and Janneman 
Groenewald travelled 
throughout the United States 
between 2005 and 2010, 
charging people between 
US$3,500 and $15,000 to hunt 
‘problem rhinos in South 
Africa that they claimed 


The news in brief 


Skydiving hits new heights 


Alan Eustace, a senior vice-president at Google 
in Mountain View, California, broke the 

sound barrier and a world skydiving record on 
24 October, when he jumped from a reported 
altitude of 41.4 kilometres over Roswell, New 
Mexico. Eustace began his roughly 2.5-hour 
ascent into the upper stratosphere around 
dawn, being dragged skyward by a helium- 
filled scientific balloon. A pressurized space 


needed to be euthanized. The 
two were also charged with 
secretly trafficking rhino 
horns, and laundering their 
profits through bank accounts 
that they had set up for their 
business in Autauga County, 
Alabama. 


Funding reform 


China's government on 

21 October announced the 
passage of a reform plan 

to overhaul the funding 

of research in the country. 
Under the plan, the main 
competitive government grant 
schemes will be eliminated 

by 2017, including two core 
programmes that supported 
China's science and technology 
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development in recent decades. 
Funding will be reorganized 
into five programmes 
managed by a new science and 
technology agency. The move 
is part of an effort to reduce 
wastefulness and to consolidate 
grant management, currently 
distributed across various 
government departments. 

See pages 535 and 547 and 
go.nature.com/pzzvf5 

for more. 


Full text for free 
More than half of all peer- 
reviewed research articles 
published between 2007 and 
2012 are now free to download 
from the Internet, according to 
a 22 October report produced 


suit (pictured) designed by Paragon Space 
Development in Tuscon, Arizona, helped 
Eustace to breathe at high altitude and prevented 
fluids in his body from boiling. On his way 
down, Eustace reached an estimated peak speed 
of around 1,320 kilometres per hour. In October 
2012, Austrian parachutist Felix Baumgartner 
set the previous record-high skydive, jumping 
from an altitude of 39 kilometres. 


for the European Commission. 


By comparison, last year’s 
assessment found that papers 
from only one year (2011) had 
reached 50% free access. See 
go.nature.com/dbw76k 

for more. 


Diversity grants 

The US National Institutes of 
Health (NIH) has awarded 
US$31 million in grants aimed 
at encouraging minorities 
and under-represented 
groups to pursue careers in 
biomedical fields, the agency 
announced on 22 October. 
The grants target institutions 
that receive relatively little 
NIH funding and enrol a high 
proportion of students from 
low-income backgrounds. 
The funds will support 


PARAGON SPACE DEVELOPMENT CORPORATION 


NASA/JPL/UNIV. ARIZONA 


SOURCE: WHO 


ways to improve access to 
research opportunities for 
undergraduates, build a 
national mentoring network, 
and assess the results of the 
initiatives themselves to 
understand which strategies 
bring lasting change. 


EU climate goals 


European Union (EU) member 
states have agreed to reduce 
the region’s greenhouse-gas 
emissions by at least 40% by 
2030, relative to 1990 levels. 
The EU's new climate and 
energy policy framework, 
released on 24 October, 
includes a politically binding 
target of getting 27% of energy 
from renewable sources by 
2030; and a goal of improving 
overall energy efficiency by 
27%, to be reviewed in 2020 
with the hope of increasing the 
latter target to 30%. See Nature 
http://doi.org/wmb (2014) 

for more. 


Commission vote 


The European Parliament 
approved a new European 
Commission on 22 October, 
which will begin a five- 

year term on 1 November. 
Portuguese economist 

Carlos Moedas will serve as 
commissioner for research, 
science and innovation, while 
Spain’s Miguel Arias Canete, a 
lawyer, will be commissioner 
for energy and climate 
change. See go.nature.com/ 
t73tfi for more. 


TREND WATCH 


Comet close-up 
NASAs Mars Reconnaissance 
Orbiter snapped comet C/2013 
Al Siding Spring (pictured) 

as it buzzed past the red planet 
on 19 October. The image, 
released last week, is the first 
ever taken of the nucleus ofa 
comet from the Oort Cloud, 
the reservoir of icy bodies 

that lies far beyond Pluto. 

The nucleus of Siding Spring 
turned out to be smaller 

than expected, measuring 

less than 500 metres across. 
Siding Spring passed just 
138,000 kilometres from the 
Martian surface. See go.nature. 
com/psvju9 for more. 


Ebola vaccines 

On 24 October, the World 
Health Organization 

(WHO) announced plans to 
produce millions of doses of 
experimental Ebola vaccines 
by the end of 2015, with 
hundreds of thousands of 
doses available by June. The 
outbreak has infected more 
than 10,000 people and killed 
around 5,000, the WHO 
estimates. Phase I trials of two 


candidate vaccines have begun, 
and the first phase II trials 

are set to start in Liberia in 
December and in Sierra Leone 
in January. See go.nature.com/ 
pj71hi for more. 


First light 


The National Synchrotron 
Light Source I, a high-energy 
photon source at Brookhaven 
National Laboratory in Upton, 
New York, produced its first 
photons on 23 October — a 
major milestone towards 
achieving full research 
capabilities. Thirty beamlines 
are under development around 
the 792-metre electron- 
storage ring, where specialized 
magnets coax the electrons 
into producing intense beams 
of X-rays. These can be guided 
into experimental stations 

and used to probe matter. The 
first experiments are expected 
before the end of this year. 


FACILITIES 


Cancer centre 

The University of Oxford, 
UK, announced plans on 

23 October to establish the 
Precision Cancer Medicine 
Institute, a £110-million 
(US$177-million) research 
facility funded by charitable 
and commercial partners and 
a £35-million government 
grant. The institute will test 
precision treatments such 
as proton-beam therapy for 
patients with early-stage 


TUBERCULOSIS TOLL REVISED UPWARDS 


The World Health Organization estimates that 9 million people 


Anestimated 9 million people 
developed tuberculosis (TB) 

last year, the World Health 
Organization (WHO) said ina 

22 October annual update (see 
go.nature.com/rohvhp). Although 
rates of new cases and deaths are 
thought to be falling, this figure is 
400,000 more than that estimated 
for the previous year, because 
improved measurements enable 
the WHO to be more confident 
in its figures. The agency said 
that annual funding for TB 
prevention needs to rise by some 
US$2 billion, to $8 billion. 


developed tuberculosis (TB) last year. 
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SEVEN DAYS | THIS WEEK | 


5-8 NOVEMBER 

The annual meeting 

of the Society of 
Vertebrate Paleontology 
takes place in Berlin, 
where scientists will 
discuss vertebrate 
origins, evolution and 
functional morphology. 
go.nature.com/hmmzkm 


6-9 NOVEMBER 
At the International 
Conference of Crisis 
Mappers in New York 
City, aid workers, 
technologists and 
policy-makers will 
discuss best practices 
and recent innovations 
in humanitarian 
technology. 
go.nature.com/j19byw 


cancers that currently have a 
poor prognosis. The centre is 
also expected to house a new 
research partnership with the 
non-profit Chan Soon-Shiong 
Institute of Molecular Medicine 
in Culver City, California, 
which is intended to target 
treatments more precisely 
using large amounts of genetic, 
molecular and clinical data. 


AWARDS 


Maddox prize 

The John Maddox Prize for 
Standing up for Science was 
awarded on 27 October to 
physicist David Robert Grimes, 
a postdoc and journalist at 

the University of Oxford, UK, 
and US freelance journalist 
Emily Willingham. The award 
honours people who promote 
science in the public interest, 
often despite obstacles. Named 
after a former editor of Nature, 
the prize is jointly awarded by 
Nature, the Kohn Foundation 
in London and the UK 
campaign group Sense About 
Science. See go.nature.com/ 
rbjzsg for more. 
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Record amounts of gas have spewed from Iceland’s Holuhraun plain. 


Icelandic volcano 
stuns scientists 


Sulphur -rich eruption defies preparations for an ashy blast. 


BY ALEXANDRA WITZE IN REYKJAVIK 


tinged blood red, of late. Above the mari- 
time bustle of Reykjavik’s harbour and the 
city’s towering concrete Hallgrimskirkja church, 
volcanic pollution gives the skies an eerie glow. 
For eight weeks, lava has been spurting out 
of a fissure in the ground radiating from the 
Bardarbunga volcano, about 250 kilometres 
from Reykjavik. Sulphur dioxide has been 
spurting too — 35,000 tonnes ofita day, more 
than twice the amount spewing from all of 
Europe's smokestacks. The gas has spread 


| celandic sunrises and sunsets have been 


across the Icelandic countryside, causing 
people to wheeze and trapping some indoors. 

The record-setting amount of pollution has 
surprised even volcanologists in the middle 
of a major project funded by the European 
Union to understand the island’s fiery activ- 
ity. They had been preparing for a repeat of 
the 2010 Eyjafjallajokull eruption, which led 
to a billowing ash plume that grounded planes 
across Europe. “Everybody was expecting a big 
ash cloud, and now we have something totally 
different,’ says Anja Schmidt, an atmospheric 
modeller at the University of Leeds, UK, who 
studies how volcanic gases spread. 


The timing of the eruption was just about 
perfect for the project, called FUTUREVOLC. 
The initiative aims to use Iceland as a natural 
laboratory to understand how magma makes 
its way from deep in Earth's crust to the sur- 
face — to do so, its organizers have focused on 
four of Iceland’s most active volcanoes, one of 
which is Bardarbunga. The researchers used 
extra seismometers and global-positioning- 
system (GPS) stations to bolster the moni- 
toring network maintained by the Icelandic 
Meteorological Office and the University of 
Iceland, both in Reykjavik. And they used 
these to measure the events leading up to and 
after the eruption with unprecedented detail. 


THE LONG MARCH 

Earthquakes began shaking Bardarbunga on 
16 August. For two weeks, researchers watched 
as seismic activity marched north and east, 
towards the edge of the ice cap that covers the 
volcano. GPS stations measured the ground 
flexing upward as huge amounts of magma 
shifted underground. “The seismic shows us 
the detail, and the GPS shows us the volume,’ 
says Kristin Vogfj6r6, a seismologist at the Ice- 
landic Meteorological Office and co-leader of 
FUTUREVOLC. 

For 45 kilometres, the magma crept along, 
cooling and forming an underground sheet 
known as a dike. By 29 August, it had made it 
to the edge of the ice cap and begun erupting 
into a barren plain called Holuhraun. 

Since then, the eruption has spewed at least 
half a cubic kilometre of lava, making it the 
largest lava-producing eruption in Iceland 
since 1947. But the quantity of gas is what has 
startled scientists most. Chunks of rock col- 
lected from the eruption show how gas-rich 
the lava is; the rocks are porous, filled with air 
pockets where the gas has leaked out. 

With the right winds, the sulphur that 
Holuhraun produces can reach as far as 
the European continent, where Austria has 
recorded more sulphur in its air than any time 
since the industrial clean-up of the 1980s. 

FUTUREVOLC has placed some basic 
gas-monitoring equipment at its volcanoes, 
including two types of spectrometer at the 
Holuhraun site. One measures concentrations 
of SO, by studying how it absorbs particular 
wavelengths of sunlight. The other scans for 
multiple gasses simultaneously. 

Both types of instrument rely on daylight 
to make their measurements, and the 
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> encroaching darkness of the high northern 
winter will limit their usefulness, says Sara 
Barsotti, an atmospheric physicist at the Ice- 
landic Meteorological Office. (Even getting to 
the eruption site, which is remote, is getting 
harder as the winter sets in.) Researchers will 
try to continue making as many gas measure- 
ments on the ground as possible, and compare 
those to the rough estimates made by satellites, 
Barsotti says. 

Icelandic officials are trying to work out how 
much ofa hazard the sulphur is and whether 
they can predict its movement more accurately. 
The meteorological office has begun issuing 
forecasts of where the gas is likely to travel 
each day. Sulphur spikes as high as 21,000 


micrograms per cubic metre were measured 
last weekend in the town of H6fn; the World 
Health Organization recommends no more 
than 500 micrograms per cubic metre for a 
10-minute exposure. 

No one has died in the eruption, and the 
plume is not high enough to penetrate the 
stratosphere and cause widespread climate 
perturbations. But the million tonnes of sul- 
phur emitted so far are an unprecedented 
experiment in testing the effects of toxic-gas 
exposure, Barsotti says. 

Lessons from Iceland may prove useful in 
understanding long-term gas exposure in 
other volcanic regions, such as Japan, Indone- 
sia and Hawaii. In the early 2000s, residents 


around the Miyake-jima volcano in Japan were 
evacuated when it began erupting with roughly 
the same level of sulphur emissions. 

In Iceland, the last similar event was a fis- 
sure eruption known as the Krafla fires that 
began in 1975 and lasted on and off until 1984, 
says Freysteinn Sigmundsson, a volcanologist 
at the University of Iceland and co-leader of 
FUTUREVOLC. Ifthe current eruption is tap- 
ping magma deep in the crust, as the lava's vol- 
ume and chemistry suggest, then it, too, may 
continue for months or even years. 

“This eruption comes at a good time for the 
project,’ Sigmundsson says, waving out his 
office window at Reykjavik’s red skies. “We 
don't see an end in sight.” m 


‘Forgotten’ NIH smallpox virus 
languishes on death row 


World Health Organization lacks resources to witness destruction of stocks. 


BY SARA REARDON 


US National Institutes of Health (NIH) 

in Bethesda, Maryland, in June are 
marked for death by autoclave — yet 
they linger in a high-security freezer. 
Chalk it up to bureaucracy: an interna- 
tional agreement requires that the World 
Health Organization (WHO) witnesses 
the destruction of the samples, but the 
agency is overwhelmed by the Ebola out- 
break in West Africa. 

The delay adds another twist to the strange 
tale of the vials, which sat forgotten for six 
decades in an unmarked cardboard box. It 
also mirrors the fierce debate over whether to 
destroy the two other known stocks of small- 
pox in the world. That decision is on hold 
while the WHO determines whether small- 
pox virus could be synthesized from scratch 
by those with nefarious intent, and whether 
virus stocks should therefore be preserved 
to help develop countermeasures against a 
terrorist attack. 

The NIH smallpox was discovered in an 
agency ‘cold room used by the US Food and 
Drug Administration; soon after, the vials were 
shipped to the US Centers for Disease Control 
and Prevention (CDC) in Atlanta, Georgia. 
That agency, and its Russian counterpart — 
VECTOR, in Koltsovo — are the only two 
laboratories authorized to work with smallpox 
under an international agreement that also calls 
for the eventual destruction of all stocks. (The 


\ ix vials of smallpox virus discovered at the 


Research stocks of smallpox virus are limited. 


virus was eradicated in the wild in 1979.) 

The CDC promised to destroy the NIH 
samples immediately, with WHO officials 
present. But that has proved more difficult than 
anticipated. The WHO has dealt with just one 
other surprise smallpox discovery: viral DNA 
found in a South African laboratory in 2013. 
WHO officials were present when that DNA 
was destroyed in January this year — but no 
WHO employee is certified to enter the CDC’s 
high-security smallpox lab. This means that 
a WHO official must fly to Atlanta to witness 
the destruction of the virus on closed-circuit 
television. Arranging the trip has been made 
more difficult by the Ebola crisis, says Alejan- 
dro Costa, head of the WHO team in Geneva, 
Switzerland, that monitors smallpox issues. 
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The authorized stocks of smallpox at the 
CDC and VECTOR are also in limbo. For more 
than 20 years, the US and Russian govern- 
ments have fought efforts to set a deadline 

to destroy the samples. They argue that 
the virus should be preserved to test 
vaccines and antivirals that might be 
needed to respond to an accidental 
smallpox release or a terrorist attack. 
In May, the WHO’s policy-making 
body, the World Health Assembly, again 
put offa decision on the fate of the smallpox 
stocks. Unusually, the group did not say when 
it might revisit the issue. 

Virologist D. A. Henderson, who led the 
WHO’s programme to eradicate smallpox, 
argues that all stocks should be destroyed, ina 
kind of mutual disarmament. “Anyone found 
with isolates of smallpox virus after point X 
would be guilty of crimes against humanity, 
says Henderson, nowat the Center for Health 
Security in Baltimore, Maryland. 

At a meeting last week, members of the 
WHO Advisory Committee on Variola Virus 
Research agreed to commission a report assess- 
ing the risk that a bioterrorist could synthesize 
smallpox — a situation that might add weight 
to the argument for keeping the natural virus. 
Costa says that the WHO will postpone any 
decision on stock destruction until the report 
is released, in 2016 at the earliest. 

The NIH smallpox is likely to be destroyed 
well before then: Costa estimates by January 
2015. In the meantime, experts say that the 
virus is safe at the CDC. m 
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Coastal havoc boosts jellies 


Five-year Chinese study suggests that human activity made gelatinous outbreaks worse. 


BY JANE QIU IN QINGDAO 


marine ecologist Sun Song, recalling a 

jellyfish outbreak last year in the Yellow 
Sea off China's eastern coast. “The slimy mon- 
sters were everywhere, their long tentacles flut- 
tering ferociously in the rolling waves.’ Such 
blooms have repeatedly choked Chinese waters 
in the past decade, posing substantial threats 
to tourism, fisheries and coastal facilities such 
as chemical plants and nuclear power stations. 

Jellyfish blooms have probably struck the 
world’s seas periodically for hundreds of 
millions of years. But researchers suspect 
that human activity has made them worse, 
with knock-on effects on marine ecosys- 
tems and especially on fish, because jelly- 
fish compete with fish for food and feed on 
their larvae. Now, one of the largest stud- 
ies of jellyfish blooms so far — involving 
laboratory testing, ocean surveys and field 
experiments — has found direct evidence 
linking the proliferation of jellyfish to human 
disturbances along the coast, especially the 
disruption of sea-floor ecosystems. 

The five-year, US$4.5-million initia- 
tive started work in 2010 led by Sun, who is 
director of the Chinese Academy of Sciences’ 
Institute of Oceanology in the coastal city of 
Qingdao. Sun's team, involving eight institu- 
tions and more than 100 researchers, presented 
its latest results earlier this month at the World 
Conference on Marine Biodiversity, also in 
Qingdao. China's waters are not the only ones 
to have had jellyfish outbreaks in recent years, 
so the results “could help to tackle similar 
problems in other parts of the world” such 
as the Mediterranean or the Caspian Sea, 
says Mark Costello, a marine ecologist at the 
University of Auckland in New Zealand. 

The survey shows that Chinese coasts are 
plagued primarily by three jellyfish species: 
Aurelia aurita, Cyanea nozaki and the most 
damaging one, the giant Nemopilema nomu- 
rai, which can grow to 2 metres in diameter 
and weigh as much as 200 kilograms. 

Jellyfish are most recognizable in the short- 
lived ‘medusa’ stage of their life cycles, when 
they swim freely and reproduce sexually. But 
they also go through a stage in which they are 
bound to the sea floor as ‘polyps. These can 
live for years and reproduce asexually, budding 
new medusae. Most research so far has focused 
on the medusa stage, but “the key to jellyfish 
outbreaks may lie in the polyps’, says Sun. 

Polyps are notoriously difficult to find and 


cc E was a truly gelatinous world,’ says 


A polyp of Nemopilema nomurai, 1 millimetre long, will give rise to medusae more than 2 metres across. 


collect: they are transparent and easy to break, 
and can be just micrometres long. “Numerous 
attempts by scuba diving, underwater robots 
and dredging up the sediments have led to noth- 
ing,’ Sun says. So the team turned to the jelly- 
fish in the oceanology institute's aquarium. The 
researchers grew polyps on hard surfaces such 
as glass, plastic or stone in the laboratory and 
then lowered the slabs onto the seabed at halfa 
dozen coastal locations, to see if specific envi- 
ronmental factors would spur a mini-outbreak. 

As Sun reported at the conference, polyps of 
all three species tended to sprout medusae when 
sea-bottom temperatures were about 10-15 °C. 
The blooms were greatly enhanced by extreme 
temperature swings. Moreover, more medusae 
formed in waters that had fewer bottom dwell- 
ers, or ‘benthos’ — animals such as starfish, sea 
slugs, shrimp and fish — and greater environ- 
mental disturbance. Nutrient overloads from 
sewage and agricultural run-off, and construc- 
tion of infrastructure such as ports and bridges, 
seemed to help the jellyfish to thrive. 

The findings point to a link between the 
gelatinous scourges and coastal disturbances in 
China caused by overfishing, bottom trawling 
and oxygen depletion following algal blooms, 
as well as coastal development and increasingly 
frequent extreme-weather events, says Cos- 
tello. Moreover, a decline in both the number 
and the diversity of benthos in the past 50 years 
may have also contributed, says Xu Kuidong, a 
marine ecologist at the Institute of Oceanology. 
Benthos that feed on polyps “are the key biologi- 
cal control of jellyfish populations’, he says. 


The ocean currents that skirt China’s coast 
can affect benthos by changing temperatures 
and nutrient concentrations, so a broader 
approach will be necessary to unveil the true 
causes of jellyfish outbreaks, says Sun. In 
April, he and his colleagues launched a five- 
year, $165-million project to study ocean 
currents. By measuring the motion of currents 
at depths of up to 6,000 metres, and learning 
how ecosystems respond, the researchers hope 
to piece together how local and regional factors 
combine to cause blooms. “This might allow us 
to forecast — and even prevent — a forthcom- 
ing outbreak,” says Sun. 

And in May, an international collabora- 
tion launched the Jellyfish Database Initiative 
(JeDI), which will provide global, open-access 
data on jellyfish abundances and associated 
environmental factors. Such projects — as 
well as more-detailed studies of polyps — will 
attempt to improve understanding of outbreaks 
and the contribution of natural variations and 
human factors to changes in jelly populations 
around the world, says Cathy Lucas, a marine 
biologist at the University of Southampton, UK, 
who contributes to JeDI. 

Sun says that increasing rates of jellyfish 
outbreaks in Chinese waters might be an 
indicator of worsening ecosystem health. 
Over the past 500 million years, the creatures 
have survived countless climatic and environ- 
mental upheavals — including all five known 
mass-extinction events, he points out. “When 
ecosystems deteriorate, they are likely to thrive 
when others fail.” m 
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Large dogs like this Old English Sheepdog are promising test subjects for longevity studies. 


Pet dogs set to test 
anti-ageing drug 


Trials would study extension of lifespan in domestic setting. 


BY ERIKA CHECK HAYDEN 


east, worms and mice: all have lived 
VY iene: when treated with various 
chemical compounds in laboratory 
tests. But many promising leads have failed 
when tried in humans. This week, researchers 
are proposing a different approach to animal 
testing of life-extending drugs: trials in pet 
dogs. Their target is rapamycin, which is used 
clinically as part of an anti-rejection drug 
cocktail after kidney transplants and which 
has also been shown to extend the lives of 
mice by 13% in females and 9% in males (D. E. 
Harrison et al. Nature 460, 392-395; 2009). 
The compound’ effect on lifespan has not yet 
been tested in people — human trials are expen- 
sive and it takes a long time to learn whether 
a drug can extend a human life. Furthermore, 
rapamycin is no longer patentable, so pharma- 
ceutical companies are unwilling to invest effort 
in it. The drug can also cause some serious side 


2 
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VIDEO OF THE WEEK 


Oil from Deep 
Horizon spill 
settled on more 
of the sea floor 
than expected 
go.nature.com/ 
urmych 


effects that might rule it out as a pharmaceutical 
fountain of youth. It has, for example, been 
linked to an increased risk of diabetes in people 
who have had kidney transplants (O. Johnston 
et al. J. Am. Soc. Nephrol. 19, 1411-1418; 2008). 
But at low doses, researchers suspect that the 
drug will not be a problem for healthy dogs. 

“Any compound has side effects of some 
sort; the question is what kind of side effects 
one can live with,” says Randy Strong, a geron- 
tologist at the Barshop Institute for Longevity 
and Aging Studies in San Antonio, Texas. 

To answer that question, molecular biologists 
Matthew Kaeberlein and Daniel Promislow 
at the University of Washington in Seattle pro- 
pose to give low doses of rapamycin to dogs in 
a study that would also test whether the drug 
can extend the animals lives. The pair invited 
experts in canine health and the biology of age- 
ing to a meeting in Seattle on 28-29 October to 
discuss how to structure such a trial. 

The researchers hope to test rapamycin in 
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large dogs that typically live for eight to ten 
years; they would start giving the drug to ani- 
mals aged six to nine. A pilot trial would involve 
about 30 dogs, half of which would receive the 
drug, and would allow the researchers to dose 
the dogs for a short time and observe effects 
on heart function and some other health meas- 
ures. The trial could be completed in as little 
as three years, but researchers will know long 
before that — perhaps in months — whether 
rapamycin improves cardiac function or other 
aspects of health. 

Rapamycin acts on a protein that is involved 
in cell growth, but little is known about how it 
extends life. It might retard the ageing process 
itself or it might prevent age-related diseases. 
One hypothesis is that it works primarily by 
preventing the development of cancers. 

Pet dogs should provide a more realistic test 
than lab mice of how the drug would work in 
humans. Pets experience some of the same 
environmental influences and get some of the 
same age-related diseases as their masters, says 
Kaeberlein. (He plans to enrol his own German 
shepherd dog when it is old enough.) 

Other researchers say that Kaeberlein and 
Promislow’s reasoning makes sense. “We're talk- 
ing about whether aged pets will benefit, and 
that’s a good model for a human population,” 
says physiological geneticist David Harrison of 
the Jackson Laboratory in Bar Harbor, Maine, 
who has studied rapamycin in mice. 

Kaeberlein and Promislow have collected 
about US$200,000 in University of Washing- 
ton institutional funds for the pilot. But they 
will need much more funding for a larger trial of 
several hundred dogs to test whether the drug, 
given over years, can extend lifespan — and to 
study the normal ageing process in thousands of 
animals to try and understand the mechanism 
of any life-extending effects. 

Participants at the Seattle meeting are also 
considering whether pet owners could be asked 
to help fund a bigger trial and if so, whether 
researchers could promise donors that their 
dogs would be treated with rapamycin rather 
than a placebo without compromising the study. 
Usually, participants in a gold-standard clini- 
cal trial are randomly assigned to a treatment 
or control group. 

“Tt’s a kind of unique situation because they 
are people’s pets,” says Kaeberlein. “We haven't 
come to any decision on how that’s going to be 
handled” m 
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China opens translational 
medicine centre in Shanghai 


First of five linked institutes aims to capitalize on basic-research investments. 


BY DAVID CYRANOSKI 


ver the past decade, China has poured 
() money into research, especially in the 

biomedical sciences. But as the nation’s 
health-care costs have risen in the past few 
years, critics have argued that the investment 
has not paid off. A group of researchers and gov- 
ernment officials now hopes to improve those 
returns with the official opening this month of 
the National Centre for Translational Medicine 
in Shanghai. The 1-billion-renminbi (US$163- 
million) centre, slated to complete construction 
in 2017, is the first of five institutions meant to 
bridge the gap between basic research and clini- 
cal application by putting researchers, doctors 
and patients under one roof. 

China's biologists have made impressive 
progress in fields such as genome sequencing 
and protein-structure analysis, but have pro- 
duced little in terms of drugs or other medi- 
cal products. “Bloggers and others are always 
complaining that China is just burning money,’ 
says Xiao-Fan Wang, a cancer researcher at 
Duke University in Durham, North Carolina, 
and one of the 21 people on the centre’s inter- 
national consulting committee. Some even ask 
why China, which has made progress in indus- 
try largely by copying other countries, has not 
succeeded by following the same strategy in the 
biomedical sciences, says Wang. 

Wang says that doctors in China are over- 
worked, often racing through whole days of 
back-to-back patient visits and procedures. 
And yet, because most Chinese hospitals are 
part of universities, the doctors must publish 
to get promoted. In this competitive environ- 
ment, they often refuse to share data, but are 
rarely able to do thorough research themselves. 
“You can’t expect them to do eight hours of sur- 
gery and then jump into a lab coat,’ says Wang. 
The Shanghai centre will change that, he says, 
by giving clinicians time to focus on research 
questions. 

Saijuan Chen, the centre’s director and a 
geneticist at Shanghai Jiaotong University, says 
that the centre began interviews this month to 
recruit some 50 principal investigators and 
12 scientists to direct research in their disci- 
plines. She says that the institution's focus will 
be on developing treatments for heart disease, 
stroke, metabolic diseases and cancer. 

The centre’s international consulting 
committee will help to recruit top talent in these 


A major push is under way in China to develop new drugs and surgical procedures. 


fields. Tak Mak, a committee member and an 
immunologist at the University of Toronto in 
Canada, says that the strategy shows a commit- 
ment to hiring on the basis of expertise rather 
than connections. It is “an effort to break out of 
the old system — which is probably the system 
in more places than youd care to know’, he says. 


FINE BALANCE 

One challenge will be attracting Chinese 
biomedical researchers who are working 
abroad. Wang says that China has been able 
to lure basic-research scientists back to the 
country with competitive pay. But clinicians 
in China receive a fraction of the salaries they 
can get in the United States. 

Managing the centre may also be difficult, 
says committee member Sujuan Ba, chief oper- 
ating officer of the US National Foundation for 
Cancer Research in Bethesda, Maryland. The 
centre's governing council comprises members 
from ten government agencies and institutions. 
“That shows the wide scope of support from 
China, but it is going to be a huge challenge for 
the leadership team to balance and to manage 
each council member's requisites,” she says. “It 
is very important for the centre to steer away 
from bureaucratic red tape and stay focused on 
its mission and long-term vision of conducting 
high-impact translational research.” 

The 54,000-square-metre site will have 
300 beds for patients and study volunteers. 


It will also run a biobank that will collect 
hundreds of thousands of patient samples, 
and an ‘omics’ centre that will conduct high- 
throughput genome analyses and gather data 
on proteins and metabolic products. 

Haematologist Zhu Chen of Shanghai Jiao 
Tong University, who is China's former health 
minister and chairman of the centre's scien- 
tific advisory board, hopes that the centre will 
emulate the spirit of St Jude Children’s Research 
Hospital in Memphis, Tennessee, with its close 
relationship between clinicians and basic 
researchers. He also emphasizes the need to 
make therapeutic trials free: in China, partici- 
pants are often charged for treatment. Zhu and 
Saijuan Chen, who are married, led one of the 
few successful translational research projects in 
China — treatment ofa form of leukaemia using 
retinoic acid and arsenic trioxide. 

The Shanghai centre will eventually have 
four sister institutions: a geriatrics centre at 
the People’s Liberation Army general hospital 
in Beijing, a centre for rare and refractory dis- 
eases at the Peking Union Medical College in 
Beijing, a molecular-medicine research centre 
at the Fourth Military Medical University in 
Xian and a regenerative-medicine centre at the 
West China Hospital in Chengdu. 

“These centres are at the historic moment to 
make a huge impact for China’s drug develop- 
ment,’ says Ba. “We should be able to see the 
signs of success in five years, if not earlier: m 
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Geneticists tap human knockouts 


Sequenced genomes reveal mutations that disable single genes and can point to new drugs. 


BY EWEN CALLAWAY 


function by inactivating the gene in ques- 
tion in mice and other lab animals, and 
then observing how it affects the organism. Now 
researchers studying such gene ‘knockouts’ have 
another, ideal model at their disposal: humans. 

The approach does not involve genetically 
engineering mutant people in the lab, as is 
done in mice. Instead, researchers scan the 
genomes of thousands or millions of people, 
looking for naturally occurring mutations 
that inactivate a particular gene. By observing 
how these mutations affect health, researchers 
hope to gain insight into basic biology and to 
unearth new disease treatments. 

Geneticists discussed several such large-scale 
efforts during a packed session at the Ameri- 
can Society of Human Genetics meeting in San 
Diego, California, last week. “So much of what 
we know is based on mice and rats, and not 
humans,’ says Daniel MacArthur, a genomicist 
at Massachusetts General Hospital in Boston, 
whose team identified around 200,000 varia- 
tions that naturally knocked out genes by trawl- 
ing the protein-coding portion of the genome, 
or exome, in more than 90,000 people. “Now 
we can find people who actually have a particu- 
lar gene inactivated or somehow modified, and 
that allows us to test hypotheses directly.’ 

On average, every person carries mutations 
that inactivate at least one copy of 200 or so 
genes and both copies of around 20 genes. 
However, knockout mutations in any particu- 
lar gene are rare, so very large populations are 
needed to study their effects. These ‘loss of 
function mutations have long been implicated 
in certain debilitating diseases, such as cystic 
fibrosis. Most, however, seem to be harmless — 
and some are even beneficial to the persons 
carrying them. “These are people we're not 
going to find ina clinic, but they’re still really 
informative in biology,’ says MacArthur. 

His group and others had been focusing on 
genome data, but they are now also starting to 
mine patient-health records to determine the 
— sometimes subtle — effects of the mutations. 
Ina study of more than 36,000 Finnish people, 
published in July (E. T: Lim et al. PLoS Genet. 10, 
e1004494; 2014), a team that included MacAr- 
thur discovered that people lacking a gene called 
LPA might be protected from heart disease, and 
that another knockout mutation, carried in one 
copy ofa gene by up to 2.4% of Finns, may cause 
fetuses to miscarry ifit is present in both copies. 

Bing Yu of the University of Texas Health 


Be decades, biologists have studied gene 


AL, 


Finland offers geneticists a rich seam of variation. 


Science Center in Houston told the meeting 
how she and her collaborators had compared 
knockout mutations found in more than 
1,300 people with measurements of around 
300 molecules in their blood. The team found 
that mutations in one gene, called SLCO1B1, 
were linked to high levels of fatty acids, a known 
risk factor for heart failure. And a team from 
the Wellcome Trust Sanger Institute in Hinxton, 
UK, reported that 43 genes whose inactivation 
is lethal to mice were found to be inactivated in 
humans who are alive and apparently well. 


POSTER CHILD 

Following up on such insights will help 
researchers to unpick the functions of the 
thousands of human genes about which little 
or nothing is known, say MacArthur and oth- 
ers. It might even aid drug discovery by iden- 
tifying genes or biological pathways that could 
protect against disease. 

The poster child for human-knockout efforts 
is a new class of drugs that block a gene known 
as PCSK9 (see Nature 496, 152-155; 2013). The 
gene was discovered in French families with 
extremely high cholesterol levels in the early 
2000s. But researchers soon found that people 
with rare mutations that inactivate one copy of 
PCSK9 have low cholesterol and rarely develop 
heart disease. The first PCSK9-blocking drugs 
should hit pharmacies next year, with manufac- 
turers jostling for a share of a market that could 
reach US$25 billion in five years. 

“T think there are hundreds more stories 
like PCSK9 out there, maybe even thousands,” 


in which a drug can mimic an advantageous 
loss-of-function mutation, says Eric Topol, 
director of the Scripps Translational Science 
Institute in La Jolla, California. Mark Gerstein, 
a bioinformatician at Yale University in New 
Haven, Connecticut, predicts that human 
knockouts will be especially useful for identi- 
fying drugs that treat diseases of ageing. “You 
could imagine there's a gene that is beneficial to 
you as a 25-year-old, but the thing is not doing 
a good job for you when you're 75.” 

The human-knockout data presented last 
week will also make it easier to interpret the 
growing number of genomes being sequenced 
from people seeking treatment, says Nazneen 
Rahman, a medical geneticist at the Institute 
for Cancer Research in London. Rahman 
told the meeting about her team’s analysis of 
knockout mutations in 1,000 British people. 
“The take-home message is that these types of 
mutations occur much more commonly than 
people thought,’ she says. 

When a person's genome is sequenced to 
identify the cause of a mysterious illness, any 
knockout mutations that turn up are obvious 
suspects. Having a complete list of such muta- 
tions and their health effects (or lack thereof) 
should help to identify the true causes of a dis- 
ease, Rahman says. 

To that end, MacArthur’s team last week 
released knockout and other data from some 
63,000 people — and others are already mak- 
ing use of this trove. “One of the first things I 
did when they released the data was to look at 
all my favourite genes for very severe diseases, 
to see if there are people in those databases — 
and there are,” says John Belmont, a medical 
geneticist at Baylor College of Medicine in 
Houston. He found 11 people with mutations 
associated with Marfan syndrome, a disorder 
that affects connective tissue and that can 
cause heart problems ifleft untreated. 

However, these people may be silent cases, 
they might somehow be protected against the 
disease or their genomes might have been 
sequenced incorrectly. People who carry dis- 
ease-causing mutations but don't get sick are 
especially intriguing, he says. “We should pay 
attention to them,” Belmont says. “They may 
hold a key for new treatments.” m 


CORRECTION 

The News Feature ‘The ethics squad’ 
(Nature 514, 418-420; 2014) misspelled 
Susan Kornetsky’s name. 
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THE TO 
100 
PAPERS 


Nature explores the most -cited research of all time. 


BY RICHARD VAN NOORDEN, 
BRENDAN MAHER AND REGINA NUZZO 


he discovery of high-temperature 

superconductors, the determina- 

tion of DNAs double-helix struc- 

ture, the first observations that the 
expansion of the Universe is accelerating — all 
of these breakthroughs won Nobel prizes and 
international acclaim. Yet none of the papers 
that announced them comes anywhere close 
to ranking among the 100 most highly cited 
papers of all time. 

Citations, in which one paper refers to ear- 
lier works, are the standard means by which 
authors acknowledge the source of their meth- 
ods, ideas and findings, and are often used as a 
rough measure of a paper’s importance. Fifty 
years ago, Eugene Garfield published the Sci- 
ence Citation Index (SCI), the first systematic 
effort to track citations in the scientific litera- 
ture. To mark the anniversary, Nature asked 
Thomson Reuters, which now owns the SCI, 
to list the 100 most highly cited papers of all 
time. (See the full list at www.nature.com/ 
top100.) The search covered all of Thomson 
Reuter’s Web of Science, an online version of 
the SCI that also includes databases covering 
the social sciences, arts and humanities, con- 
ference proceedings and some books. It lists 
papers published from 1900 to the present day. 

The exercise revealed some surprises, not 
least that it takes a staggering 12,119 citations 
to rank in the top 100 — and that many of the 
world’s most famous papers do not make the 
cut. A few that do, such as the first observation! 


of carbon nanotubes (number 36) are indeed 
classic discoveries. But the vast majority 
describe experimental methods or software 
that have become essential in their fields. 

The most cited work in history, for example, 
isa 1951 paper’ describing an assay to deter- 
mine the amount of protein in a solution. It 
has now gathered more than 305,000 cita- 
tions — a recognition that always puzzled 
its lead author, the late US biochemist Oli- 
ver Lowry. “Although I really know it is not a 
great paper ... I secretly get a kick out of the 
response, he wrote in 1977. 

The colossal size of the scholarly literature 
means that the top-100 papers are extreme out- 
liers. Thomson Reuter’s Web of Science holds 
some 58 million items. If that corpus were scaled 
to Mount Kilimanjaro, then the 100 most-cited 
papers would represent just 1 centimetre at the 
peak. Only 14,499 papers — roughly a metre 
anda half’s worth — have more than 1,000 cita- 
tions (see “The paper mountain’). Meanwhile, 
the foothills comprise works that have been 
cited only once, if at all — a group that encom- 
passes roughly half of the items. 

Nobody fully understands what distin- 
guishes the sliver at the top from papers that 
are merely very well known — but research- 
ers customs explain some of it. Paul Wouters, 
director of the Centre for Science and Technol- 
ogy Studies in Leiden, the Netherlands, says 
that many methods papers “become a standard 
reference that one cites in order to make clear 
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to other scientists what kind of work one is 
doing”. Another common practice in science 
ensures that truly foundational discover- 
ies — Einstein’s special theory of relativity, 
for instance — get fewer citations than they 
might deserve: they are so important that they 
quickly enter the textbooks or are incorporated 
into the main text of papers as terms deemed 
so familiar that they do not need a citation. 

Citation counts are riddled with other con- 
founding factors. The volume of citations has 
increased, for example — yet older papers have 
had more time to accrue citations. Biologists 
tend to cite one another's work more frequently 
than, say, physicists. And not all fields produce 
the same number of publications. Modern bib- 
liometricians therefore recoil from methods as 
crude as simply counting citations when they 
want to measure a paper's value: instead, they 
prefer to compare counts for papers of similar 
age, and in comparable fields. 

Nor is Thomson Reuters’ list the only rank- 
ing system available. Google Scholar compiled 
its own top-100 list for Nature. It is based on 
many more citations because the search engine 
culls references from a much greater (although 
poorly characterized) literature base, including 
from a large range of books. In that list, avail- 
able at www.nature.com/top100, economics 
papers have more prominence. Google Schol- 
ar’s list also features books, which Thomson 
Reuters did not analyse. But among the science 
papers, many of the same titles show up. 

Yet even with all the caveats, the old-fash- 
ioned hall of fame still has value. If nothing 
else, it serves as a reminder of the nature of sci- 
entific knowledge. To make exciting advances, 
researchers rely on relatively unsung papers to 
describe experimental methods, databases and 
software. 

Here Nature tours some of the key methods 
that tens of thousands of citations have hoisted 
to the top of science’s Kilimanjaro — essential, 
but rarely thrust into the limelight. 


BIOLOGICAL TECHNIQUES 


For decades, the top-100 list has been domi- 
nated by protein biochemistry. The 1951 
paper’ describing the Lowry method for quan- 
tifying protein remains practically unreachable 
at number 1, even though many biochemists 
say that it and the competing Bradford assay’ 
— described by paper number 3 on the list — 
are a tad outdated. In between, at number 2, 
is Laemmli buffer*, which is used in a differ- 
ent kind of protein analysis. The dominance 
of these techniques is attributable to the high 
volume of citations in cell and molecular biol- 
ogy, where they remain indispensable tools. 
At least two of the biological techniques 
described by top-100 papers have resulted in 
Nobel prizes. Number 4 on the list describes 
the DNA-sequencing method’ that earned 
the late Frederick Sanger his share of the 
1980 Nobel Prize in Chemistry. Number 63 
describes polymerase chain reaction > 
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> (PCR)*,a method for copying segments of 
DNA that earned US biochemist Kary Mul- 
lis the prize in 1993. By helping scientists to 
explore and manipulate DNA, both methods 
have helped to drive a revolution in genetic 
research that continues to this day. 

Other methods have received less public 
acclaim, but are not without their rewards. In 
the 1980s, the Italian cancer geneticist Nico- 
letta Sacchi linked up with Polish molecular 
biologist Piotr Chomczynski in the United 
States to publish’ a fast, inexpensive way to 
extract RNA from a biological sample. As it 
became wildly popular — currently, it is num- 
ber 5 on the list — Chomczynski patented 
modifications on the technique and built a 
business out of selling the reagents. Now at 
the Roswell Park Cancer Institute in Buffalo, 
New York, Sacchi says that she received little 
in the way of monetary rewards, but takes sat- 
isfaction from seeing great discoveries built on 
her work. The technique played a part in the 
explosive growth in the study of short RNA 
molecules that do not code for protein, for 
example. “That is what I would consider, sci- 
entifically speaking, a great reward,’ she says. 


BIOINFORMATICS 


The rapid expansion of genetic sequencing 
since Sanger’s contribution has helped to boost 
the ranking of papers describing ways to ana- 
lyse the sequences. A prime example is BLAST 
(Basic Local Alignment Search Tool), which 
for two decades has been a household name 
for biologists wanting to work out what genes 
and proteins do. Users simply have to open the 
program in a web browser and plug ina DNA, 
RNA or protein sequence. Within seconds, they 
will be shown related sequences from thou- 
sands of organisms — along with information 
about the function of those sequences and even 
links to relevant literature. So popular is BLAST 
that versions*” of the program feature twice on 
the list, at spots 12 and 14. 

But owing to the vagaries of citation habits, 
BLAST has been bumped down the list by 
Clustal, a complementary programme for 
aligning multiple sequences at once. Clustal 
allows researchers to describe the evolutionary 
relationships between sequences from differ- 
ent organisms, to find matches among seem- 
ingly unrelated sequences and to predict how 
a change at a specific point in a gene or pro- 
tein might affect its function. A 1994 paper” 
describing ClustalW, a user-friendly version 
of the software, is currently number 10 on the 
list. A 1997 paper” on a later version called 
ClustalX is number 28. 

The team that developed ClustalW, at the 
European Molecular Biology Laboratory in 
Heidelberg, Germany, had created the pro- 
gram to work on a personal computer, rather 
than a mainframe. But the software was trans- 
formed when Julie Thompson, a computer sci- 
entist from the private sector, joined the lab in 
1991. “It was a program written by biologists; 


“ALTHOUGH | 
REALLY KNOW 
IT’S NOTA 
GREAT PAPER, 
| SECRETLY 
GET A KICK 
OUT OF THE 
RESPONSE.” 


I'm trying to find a nice way to say that,” says 
Thompson, who is now at the Institute of 
Genetics and Molecular and Cellular Biology 
in Strasbourg, France. Thompson rewrote the 
program to ready it for the volume and com- 
plexity of the genome data being generated at 
the time, while also making it easier to use. 

The teams behind BLAST and Clustal are 
competitive about the ranking of their papers. 
It is a friendly sort of competition, however, 
says Des Higgins, a biologist at University Col- 
lege Dublin, and a member of the Clustal team. 
“BLAST was a game-changer, and they've 
earned every citation that they get” 


Another field buoyed by the growth in genome 
sequencing is phylogenetics, the study of evo- 
lutionary relationships between species. 

Number 20 on the list is a paper’ that 
introduced the “neighbor-joining” method, 
a fast, efficient way of placing a large number 
of organisms into a phylogenetic tree accord- 
ing to some measure of evolutionary distance 
between them, such as genetic variation. It 
links related organisms together one pair at a 
time until a tree is resolved. Physical anthro- 
pologist Naruya Saitou helped to devise the 
technique when he joined Masatoshi Nei’s lab 
at the University of Texas in Houston in the 
1980s to work on human evolution and molec- 
ular genetics, two fields that were starting to 
burst at the seams with information. 

“We physical anthropologists were facing 
kind of the big data of that time,” says Saitou, 
now at Japan's National Institute of Genetics 
in Mishima. The technique made it possible 
to devise trees from large data sets without 
eating up computer resources. (And, in a nice 
cross-fertilization within the top-100, Clustal’s 
algorithms use the same strategy.) 

Number 41 on the list is a description” of 
how to apply statistics to phylogenies. In 1984, 
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evolutionary biologist Joe Felsenstein of the 
University of Washington in Seattle adapted a 
statistical tool known as the bootstrap to infer 
the accuracy of different parts of an evolution- 
ary tree. The bootstrap involves resampling data 
froma set many times over, then using the vari- 
ation in the resulting estimates to determine the 
confidence for individual branches. Although 
the paper was slow to amass citations, it rapidly 
grew in popularity in the 1990s and 2000s as 
molecular biologists recognized the need to 
attach such intervals to their predictions. 

Felsenstein says that the concept of the boot- 
strap”, devised in 1979 by Bradley Efron, a 
statistician at Stanford University in California, 
was much more fundamental than his work. 
But applying the method to a biological prob- 
lem means it is cited by a much larger pool of 
researchers. His high citation count is also a 
consequence of how busy he was at the time, 
he says: he crammed everything into one paper 
rather than publishing multiple papers on the 
topic, which might have diluted the number 
of citations each one received. “I was unable to 
go off and write four more papers on the same 
thing,” he says. “I was too swamped to do that, 
not too principled” 


Although the top-100 list has a rich seam of 
papers on statistics, says Stephen Stigler, a 
statistician at the University of Chicago in Illi- 
nois and an expert on the history of the field, 
“these papers are not at all those that have been 
most important to us statisticians”. Rather, they 
are the ones that have proved to be most use- 
ful to the vastly larger population of practising 
scientists. 

Much of this crossover success stems from 
the ever-expanding stream of data coming out 
of biomedical labs. For example, the most fre- 
quently cited statistics paper (number 11) isa 
1958 publication” by US statisticians Edward 
Kaplan and Paul Meier that helps researchers 
to find survival patterns for a population, such 
as participants in clinical trials. That intro- 
duced what is now known as the Kaplan-Meier 
estimate. The second (number 24) was Brit- 
ish statistician David Cox’s 1972 paper’® that 
expanded these survival analyses to include 
factors such as gender and age. 

The Kaplan-Meier paper was a sleeper hit, 
receiving almost no citations until comput- 
ing power boomed in the 1970s, making the 
methods accessible to non-specialists. Simplic- 
ity and ease of use also boosted the popular- 
ity of papers in this field. British statisticians 
Martin Bland and Douglas Altman made the 
list (number 29) with a technique’? — now 
known as the Bland-Altman plot — for visu- 
alizing how well two measurement methods 
agree. The same idea had been introduced by 
another statistician 14 years earlier, but Bland 
and Altman presented it in an accessible way 
that has won citations ever since. 

The oldest and youngest papers in 


the statistics group deal with the same 
problem — multiple comparisons of 
data — but from very different scientific 
milieux. US statistician David Duncan's 1955 
paper’® (number 64) is useful when a few 
groups need to be compared. But at number 
59, Israeli statisticians Yoav Benjamini and 
Yosef Hochberg’s 1995 paper”? on controlling 
the false-discovery rate is ideally suited for data 
coming from fields such as genomics or neuro- 
science imaging, in which comparisons num- 
ber in the hundreds of thousands —a scale that 
Duncan could hardly have imagined. As Efron 
observes: “The story is one of the computer 
slowly, then not so slowly, making its influence 
felt on statistical theory as well as on practice.” 


DENSITY FUNCTIONAL THEORY 


When theorists want to model a piece of matter 
— be it a drug molecule or a slab of metal — 
they often use software to calculate the behav- 
iour of the material’s electrons. From this 
knowledge flows an understanding of numer- 
ous other properties: a protein’s reactivity, for 
instance, or how easily Earth’s liquid iron outer 
core conducts heat. 

Most of this software is built on density 
functional theory (DFT), easily the most 
heavily cited concept in the physical sciences. 
Twelve papers on the top-100 list relate to it, 
including 2 of the top 10. At its heart, DFT 
is an approximation that makes impossible 
mathematics easy, says Feliciano Giustino, a 
materials physicist at the University of Oxford, 
UK. To study electronic behaviour in a silicon 
crystal by taking account of how every electron 
and every nucleus interacts with every other 
electron and nucleus, a researcher would need 
to analyse one sextillion (10”) terabytes of 
data, he says — far beyond the capacity of any 
conceivable computer. DFT reduces the data 
requirement to just a few hundred kilobytes, 
well within the capacity ofa standard laptop. 

Theoretical physicist Walter Kohn led the 
development of DFT half a century ago in 
papers”*”’ that now rank as numbers 34 and 
39. Kohn realized that he could calculate a sys- 
tem’s properties, such as its lowest energy state, 
by assuming that each electron reacts to all the 
others not as individuals, but as a smeared- 
out average. In principle, the mathematics 
are straightforward: the system behaves like 
a continuous fluid with a density that varies 
from point to point. Hence the theory’s name. 

But a few decades passed before research- 
ers found ways to implement the idea for real 
materials, says Giustino. Two??3 top-100 
papers are technical recipes on which the most 
popular DFT methods and software packages 
are built. One (number 8) is by Axel Becke, 
a theoretical chemist at Dalhousie Univer- 
sity in Halifax, Canada, and the other (num- 
ber 7) is by US-based theoretical chemists 
Chengteh Lee, Weitao Yang and Robert Parr. 
In 1992, computational chemist John Pople 
(who would share the 1998 Nobel prize with 


Kohn) included a form of DFT in his popular 
Gaussian software package. 

Software users probably cite the original 
theoretical papers even if they do not fully 
understand the theory, says Becke. “The the- 
ory, mathematics and computer software are 
specialized and are the concern of quantum 
physicists and chemists,” he says. “But the 
applications are endless. At a fundamental 
level, DFT can be used to describe all of chem- 
istry, biochemistry, biology, nanosystems and 
materials. Everything in our terrestrial world 
depends on the motions of electrons — there- 
fore, DFT literally underlies everything” 


CRYSTALLOGRAPHY 


George Sheldrick, a chemist at the University 
of Gottingen in Germany, began to write soft- 
ware to help solve crystal structures in the 
1970s. In those days, he says, “you couldn't get 
grant money for that kind of project. My job 
was to teach chemistry, and I wrote the pro- 
grams as a hobby in my spare time.’ But over 
40 years, his work gave rise to the regularly 
updated SHELX suite of computer programs, 
which has become one of the most popular 
tools for analysing the scattering patterns 
of X-rays that are shot through a crystal — 
thereby revealing the atomic structure. 

The extent of that popularity became 
apparent after 2008, when Sheldrick pub- 
lished a review paper™ about the history of 
the system, and noted that it might serve as 
a general literature citation whenever any 
of the SHELX programs were used. Readers 
followed his advice. In the past 6 years, that 
review paper has amassed almost 38,000 cita- 
tions, catapulting it to number 13 and making 
it the highest-ranked paper published in the 
past two decades. 

The top-100 list is scattered with other tools 
essential to crystallography and structural 
biology. These include papers describing the 
HKL suite” (number 23) for analysing X-ray 
diffraction data; the PROCHECK programs” 
(number 71) used to analyse whether a pro- 
posed protein structure seems geometrically 
normal or outlandish; and two programs””* 
used to sketch molecular structures (numbers 
82 and 95). These tools are the “bricks and 
mortar” for determining crystal structures, 
says Philip Bourne, associate director for data 
science at the US National Institutes of Health 
in Bethesda, Maryland. 

An unusual entry, appearing at number 22, 
is a 1976 paper” from Robert Shannon — a 
researcher at the giant chemical firm DuPont 
in Wilmington, Delaware, who compiled 
a comprehensive list of the radii of ions ina 
series of different materials. Robin Grimes, a 
materials scientist at Imperial College Lon- 
don, says that physicists, 
chemists and theorists 
still cite this paper when 
they look up values of 
ionic size, which often 
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correlate neatly with other properties of a 
substance. This has made it the highest for- 
mally-cited database of all time. 

“We often cite these kinds of papers almost 
without thinking about it,” says Paul Fossati, 
one of Grimes’s research colleagues. The same 
could be said for many of the methods and 
databases in the top 100. The list reveals just 
how powerfully research has been affected 
by computation and the analysis of large data 
sets. But it also serves as a reminder that the 
position of any particular methods paper or 
database at the top of the citation charts is also 
down to luck and circumstance. 

Still, there is one powerful lesson for 
researchers, notes Peter Moore, a chemist at 
Yale University in New Haven, Connecticut. “If 
citations are what you want,’ he says, “devising 
a method that makes it possible for people to 
do the experiments they want at all, or more 
easily, will get you a lot further than, say, dis- 
covering the secret of the Universe” = 


Richard Van Noorden is a reporter for 
Nature based in London, Brendan Maher 
is an editor for Nature based in New York, 
and Regina Nuzzo is a writer based in 
Washington DC. 
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QUESTIONS 


Scientists know a lot about the virus that causes Ebola — 
but there are many puzzles that they have yet to solve. 


BY ERIKA CHECK HAYDEN 


o much of the world, the virus behind the devastating Ebola 
outbreak in Africa seems to have stormed out of nowhere. 

But Leslie Lobel thinks we should have seen it coming. 
In 2012, Lobel and a team of researchers spent six 
months in Uganda studying the Ebola virus and related 
viruses. Over the course of their stay, these pathogens caused at least 
four separate outbreaks of disease in central Africa, affecting more than 
100 people. To Lobel, a virologist at Ben-Gurion University of the Negev 
in Beer-Sheva, Israel, the outbreaks felt like the small tremors that can 
precede a major earthquake. “We all said, something is going on here; 

something big is going to happen,” he says. 

Like Lobel, other scientists have predicted that these viruses would 
one day cause a major epidemic — and the current outbreak, which 
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has so far killed nearly 5,000 people, has proved —_ An Ebola virus 

them right. There are five species of closely _ particle from the 
related viruses that scientists refer to as ‘ebolavi- | 2014 outbreak. 
ruses’; the species behind the current outbreak, 

Zaire ebolavirus, is more generally known as ‘the Ebola virus. Along with 
Marburg virus and Lloviu virus, the ebolaviruses make up the filovi- 
ruses, a family that was unknown before the 1960s. All of the filoviruses 
share acommon structure, and most of them cause life-threatening 
haemorrhagic fevers in humans. 

Research on these once-ignored viruses took off after the 2001 
anthrax attacks in the United States, which prompted officials to sink 
money into investigating lethal pathogens that might be used in bio- 
terror attacks and to build dedicated laboratories where they can be 
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safely studied. Scientists have learned how these viruses work and have 
created the first experimental vaccines and therapies that might stop 
them. “The biodefence funding has been huge,” says microbiologist 
Thomas Geisbert of the University of Texas Medical Branch at Galveston, 
who has studied ebolaviruses for 26 years. 

But the advances in knowledge about filoviruses have also exposed 
the gaps. Scientists suspect that more members of the filovirus family 
remain to be discovered and in other parts of the world. They are also 
working to understand which animals naturally harbour filoviruses and 
why human filovirus outbreaks seem to be rising in frequency: they 
have occurred in 19 of the past 21 years, and three times this year alone. 
Finding answers is difficult because outbreaks are unpredictable and 
laboratory work with filoviruses requires the highest security measures. 

In the past few months, research has necessarily taken a back seat to 
efforts to control the Ebola outbreak — but as the epidemic escalates, 
science is coming to the fore. Researchers are recognizing that they 
might be able to stop this Ebola virus only if they understand its biology 
and how to control it. “We need a lot more information about the virol- 
ogy, the clinical presentation and the epidemiology of this virus,’ says 
Michael Osterholm, a public-health scientist at the University of Minne- 
sota’s Center for Infectious Disease Research and Policy in Minneapolis. 
“Nobody underestimates the difficulty of doing that research in these 
settings, but it is really important to get this information” With this 
in mind, Nature asked leading researchers to discuss the most urgent 
scientific questions about the Ebola virus and other filoviruses — the 
questions that, ifanswered, might prevent another disastrous outbreak, 
or help to contain the current one. This is what they said. 


WHERE DO FILOVIRUSES COME FROM? 


In July 2007, a miner who had been prospecting for lead and gold in 
a Ugandan cave became infected with Marburg virus. Officials closed 
the cave, and a team of researchers led by the US Centers for Disease 
Control and Prevention (CDC) swept in to investigate. They hoped to 
answer a decades-old question: what animal is the natural host for filo- 
viruses? The mystery had lingered since 1967, when Marburg virus — the 
first filovirus to be discovered — sickened European lab workers who 
had handled imported monkeys. The high lethality of filovirus infec- 
tion in monkeys, humans and other apes suggested that primates were 
not the natural hosts: ifa virus kills too many of its hosts, then it cannot 
propagate and dies out. There were clues that bats might be the ‘reservoir’ 
species. But to prove it, scientists needed to find an infected bat. 

The researchers captured some 1,300 bats roosting in the cave and 
tested their blood for Marburg virus’. They finally found what they were 
looking for: infectious Marburg viruses isolated from five Egyptian fruit 
bats, none of which showed symptoms of disease. The team also found 
more infected bats in a nearby cave that had been linked to a previous 
Marburg virus outbreak. 

It is not entirely clear how the virus is transmitted from bats to people, 
although the most likely route is through contact with bodily fluids. Bats 
infected with Marburg virus in the lab shed the virus in their mouths, 
so wild bats might spread it by leaving traces on fruit that is later eaten 
by other animals”. Knowing the host species for the other filoviruses 
is crucial. “Until we understand what that reservoir is, it is difficult to 
limit your encroachment on that species,’ says virologist John Dye of 
the US Army Medical Research Institute of Infectious Diseases in Fort 
Detrick, Maryland. 

Researchers now strongly suspect that bats are the natural host for 
ebolaviruses, too. In 1976, during one of the first known Ebola outbreaks, 
the six people who were initially infected worked in a factory room in 
Sudan that was home to roosting bats’. Researchers have since isolated 
antibodies to ebolaviruses from bats, as well as snippets of genetic 
material from the viruses. But proving that bats are the reservoir has 
been maddeningly difficult — no one has isolated an infectious ebola- 
virus from a wild bat, and it has been difficult to trace rare and sporadic 
outbreaks back to a source. The ebolavirus outbreaks have originated in 
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many locations, only sometimes among people or animals who have had 
contact with bats. “You're pretty much looking at the entire tropical for- 
est,’ says Jonathan Towner, a molecular virologist at the CDC in Atlanta, 
Georgia, who trapped bats in Uganda for the Marburg investigations. 

The current outbreak is thought to have begun in southeastern 
Guinea in December 2013, when a two-year-old boy died of a mys- 
terious illness that quickly spread to family members and health-care 
workers. So far, the response to the outbreak has been much more 
focused on containing it than tracing its source. “The public-health 
crisis is unprecedented,” Towner says. “There’s no room for an ecology 
investigation at this point.’ 


HOW WIDESPREAD ARE FILOVIRUSES? 


Filoviruses are not just found in bats and primates. That became clear in 
2008, when Philippine officials called for help to investigate an outbreak 
of disease in pigs. When researchers arrived, they found that the pigs 
were infected with Reston ebolavirus* — a species that was first discov- 
ered in monkeys imported to the United States from the Philippines in 
1989. The pig discovery was a shock because until then, no ebolavirus 
had been known to naturally infect a farm animal. And it was nota freak 
event: Reston ebolavirus was also reported’ in pigs in China in 2012. 
However, the Reston virus seems to be relatively harmless to humans. 
People working on the Philippine pig farms harboured antibodies to 
it — a sign that the pigs had transmitted the virus to them — but no 
humans fell ill. 

In 2011, scientists confirmed that pigs can also become infected with 
Zaire ebolavirus®. The concern now is that pigs could serve as a mixing 
vessel for filoviruses. Pigs could be simultaneously infected with several 
filoviruses, which might exchange genetic material to eventually pro- 
duce new versions that are capable of sickening humans. “The relevant 
practical question is, do we need to worry about Reston? Ifit is truly not 
pathogenic in humans, is there a potential that it could change?” says 
Erica Ollmann Saphire, a structural biologist who studies filoviruses at 
the Scripps Research Institute in La Jolla, California. 


‘We all said, something is 
soing on here; something 
bigis going to happen.” 


Scientists are probably just beginning to understand the different types 
of filoviruses and their geographic reach. The list of known filoviruses 
only recently expanded: the fifth ebolavirus (Bundibugyo ebolavirus) 
was discovered in Uganda in 2007, and the Lloviu virus was identified 
from dead bats in Spain in 2011. “We may find more members of the 
filovirus family elsewhere in the world,’ says virologist Ayato Takada 
of Hokkaido University in Sapporo, Japan. The fact that the viruses are 
more common than was once thought suggests that they have been 
around for a very long time — perhaps for most of human history. And 
it could be that scientists are detecting only a small fraction of events 
in which they cross over from animals to humans. Researchers are now 
trying to work out how frequently they jump into people and, when 
they do, how often they cause disease. In 2010, one team reported that 
as many as 20% of people in some areas of Gabon carry antibodies to 
Zaire ebolavirus in their blood, indicating that they were exposed to the 
virus in the past without becoming ill’, 

Lobel says that these data must be taken with a pinch of salt, and 
that the assays could actually be detecting immune responses to viruses 
similar to ebolaviruses. In his studies, people known to have survived 
definite Ebola virus infections had a different immune response from 
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KNOW YOUR ENEMY 


Regions at risk 

By mapping risk factors, including 
the location of fruit bats that might 
harbour ebolaviruses, scientists 
estimated the areas in Africa most 
at risk of an outbreak. 


@ High risk 
~ Medium risk 
Low risk 


Killers in the family 
The filovirus family includes Marburg virus, Lloviu virus and five 
species of ebolavirus. Some family members have been traced 
to fruit bats, from which they may pass to humans. 


Marburg virus 


Lloviu virus 


Ebola virus 


(Zaire ebolavirus) 


Bundibugyo virus 
(Bundibugyo ebolavirus) 


‘ Tai Forest virus 
Epolavinuises (Tai Forest ebolavirus) 
Sudan virus 

(Sudan ebolavirus) 


Reston virus 
(Reston ebolavirus) 


those who have never been infected. He and other researchers are now 
surveying immune responses among more filovirus survivors. 

A pressing question in the present outbreak is how the currently 
circulating Zaire ebolavirus might be changing. Its rapid spread hints 
that there could be something different about this strain — possibly that 
it has become easier to pass from one person to another. “We have to 
start investigating whether the 2014 strain behaves like previous out- 
break strains, or whether it could potentially be more transmissible,” 
says Kristian Andersen, a virologist at the Broad Institute in Cambridge, 
Massachusetts. 

Andersen is careful to note that there are no data to suggest this yet. 
Although a handful of researchers have raised fears that the virus might 
mutate into an airborne form, most think it is unlikely that the virus 
could change so dramatically in such a short time. Genetic analyses 
have shown that this strain of Zaire ebolavirus has mutated hundreds of 
times since it diverged from an ancestral virus about ten years ago’ (see 
Nature http://doi.org/vsd; 2014), but no one yet knows whether any of 
these mutations have altered important properties of the virus. Instead, 
its unprecedented spread is thought to be due to the fact that it emerged 
in an area of Africa where people were unfamiliar with the virus and 
how to control it, allowing it to escape to urban centres. Researchers are 
now carrying out more-detailed studies of the virus to better understand 
its origins and characteristics. 


ARE WE MAKING EBOLA OUR OWN WORST ENEMY? 


In September, epidemiologists published an analysis in which they 
mapped the locations of all ebolavirus outbreaks in Africa along with the 
known ranges of three bat species that are candidate reservoirs for the 
viruses’. They also plotted changes in African populations and mobil- 
ity — for instance, the proportion of each country that lives in rural 
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Sierra Leone, Guinea and Liberia — the 
countries at the epicentre of the current 
outbreak — are at high risk because 
many people live in areas inhabited by 
bats that might carry ebolaviruses. 


Before this year, all but one of the 

previous human Ebola outbreaks 

started in one of five central African 

countries: Republic of Congo, | 
South Sudan, Democratic Republic 

of Congo, Uganda and Gabon. 
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and urban areas. The team wanted to pinpoint the areas that might be 
at highest risk of future outbreaks. 

Before this year, all but one of the human ebolavirus outbreaks could 
be traced back to central Africa, and the Zaire ebolavirus species had 
never been seen in West Africa. But “we shouldn't have been massively 
surprised that it was there’, says epidemiologist Simon Hay of the 
University of Oxford, UK, who led the analysis. Even when he and his 
team did not include data from the current outbreak, they predicted 
that the three hardest-hit countries — Sierra Leone, Guinea and Libe- 
ria — would be at high risk for an ebolavirus outbreak because they 
have large numbers of people living in areas inhabited by the bats. 
In all, the analysis highlights 22 African countries where ebolavirus 
outbreaks are likely to start, putting 22 million people at risk (see 
“Know your enemy’). 

The study also begins to explain why filovirus outbreaks seem to be 
growing more frequent, widespread and larger in size. Human popu- 
lations in countries that are likely to harbour filoviruses have nearly 
tripled since the viruses were first discovered, and flight traffic has 
increased by one-third since 2005. The viruses are not coming to us; 
instead, we are encroaching on the viruses, as population growth and 
increasing travel put humans in contact with viral hosts, and then people 
unwittingly transport the viruses around the world. “There’s this percep- 
tion that these outbreaks occur in completely isolated, remote parts of 
Africa where they burn out before they hit major population centres,’ 
Hay says. “Obviously we've seen that this is not the case with the lat- 

est outbreak.” His team has released its data’’ and 
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are most likely to occur. 
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WHY IS EBOLA SO LETHAL? 


Ebola virus is one of the most lethal viruses known. In the current 
outbreak, an estimated 60-70% of those infected have died, and in 
previous outbreaks the figure has reached almost 90%. (Only rabies, 
smallpox and a handful of other viruses are as fatal if left untreated.) 
The reason that Ebola virus and other filoviruses are so lethal is because 
they turn the body’s own defences against it. 

Normally when a virus invades the body, it triggers cells in the ‘innate’ 
arm of the immune system, which cause inflammation and other reac- 
tions to fight off the infection. The Ebola virus, however, infects and 
cripples innate immune cells, taking out this first line of defence. These 
dying cells also trigger a destructive flood of chemicals, called a cytokine 
storm, and cause the downstream death of cells that normally make 
protective antibodies. 

Other highly pathogenic viruses also trigger cytokine storms, but 
filoviruses are thought to be particularly lethal because they affect a wide 
array of tissues. As well as the immune system, Ebola virus attacks the 
spleen and kidneys, where it destroys cells that help the body to regulate 
its fluid and chemical balance and that make proteins that help the blood 
to clot. At its worst, Ebola virus causes the liver, lungs and kidneys to 
shut down, other organs to fail and the blood vessels to leak fluid into 
surrounding tissues. Too often, this ends in death. 

If scientists can understand how the immune systems of survivors 
were able to fight off the virus, they might be able to encourage this form 
of defence through a vaccine. Researchers have found that survivors of 
previous outbreaks managed to make antibodies to Ebola virus, avoid 
the cytokine storm and preserve their immune cells during the course 
of infection. But why they can do this, and others cannot, is a mystery. 
“The question in our mind is, how did they survive?” Lobel says. 

The right treatment can raise the chance of survival. In the current 
outbreak, people infected with Ebola virus who are treated in devel- 
oped countries have been more likely to survive than patients treated 
in Africa, because they receive much more intensive care. There is no 
targeted treatment for filoviruses, but doctors can closely monitor and 
correct blood chemistry and protein imbalances caused by organ failure 
and fluid loss — with intravenous drips or kidney dialysis, for example. 
“Right now, the Ebola virus has such a high lethality because of the level 
of care provided in the places where it is occurring,” says Dye. “Ifhealth 
care gets better, the lethality rates will decrease” 

Sadly, Osterholm says, even the most basic interventions that could 
help people infected with Ebola virus are not being provided right now 
in the hardest-hit areas. For instance, oral rehydration therapy is being 
widely used in place of intravenous fluid replacement because of the fear 
that health-care workers could be infected in the process of inserting 
an intravenous line. Osterholm says that it is urgent to ask how these 
choices are affecting patients: “In this setting, what treatments are work- 
ing? Is what we're doing having any clinical impact?” 


CAN THE VIRUS BE STOPPED? 


Dozens of previous filovirus outbreaks have been halted using the same 
basic tools: isolating and treating patients, and tracing and monitoring 
their contacts. Public-health officials have also used this method to con- 
tain the spread of Ebola virus in Nigeria and Senegal. But across West 
Africa as a whole, the public-health response was completely inadequate 
at the start of the outbreak, which has allowed the virus to spread rapidly. 

If, as some epidemiologists predict", the disease infects tens or even 
hundreds of thousands of people by January 2015, then it could become 
nearly impossible for tried-and-tested public containment measures to 
bring the epidemic under control. For one thing, it could be unfeasible 
to recruit and train the numbers of medical staff required. At this stage, 
some experts say that a new plan is needed. 

Aid agencies and non-profit organizations are already trying new 
types of control. In Sierra Leone, officials are building isolation centres 
that can house patients away from their family and community, to 
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prevent the virus from spreading, but that have fewer qualified health- 
care workers than a standard treatment facility. It is a controversial 
move, because it risks sending the message that patients are being 
warehoused to die. But it acknowledges the harsh reality that existing 
clinics are full, patients are being turned away, and that this is fuelling 
the disease spread. “Everyone realizes these smaller-level community 
facilities are not ideal, but it is trying to do something rather than noth- 
ing to try to bring transmission down in communities,’ Hay says. 

Another new approach in this epidemic has been the use of experi- 
mental therapies and vaccines developed specifically to target Ebola 
virus. Much attention has focused on ZMapp — a cocktail of antibodies 
that was first identified using mice vaccinated with Ebola virus proteins, 
and which has been given to several people in this outbreak. In August, 
researchers reported that ZMapp protected 18 monkeys from dying of 
Ebola virus — the first report of a highly successful therapy in animals 
already showing symptoms of the disease’*. Researchers are now organ- 
izing further tests of this and other medicines, as well as a handful of 
experimental vaccines. 


“The question in our mindis, 
how did they survive?” 


But even if these therapies are proven to work, the challenge is hardly 
over. The products that have progressed furthest in development are 
all focused on Zaire ebolavirus, the most lethal of the four species that 
sicken humans, but it is unlikely that they will be as effective against 
other filoviruses. Saphire leads an international consortium that is 
systematically testing combinations of antibodies to find the ones that 
work best against different filovirus types. Researchers hope one day 
to produce therapies that work against multiple filoviruses and that 
could be used immediately when symptoms are detected, rather than 
spending precious time diagnosing exactly which virus is making 
someone ill. 

All this work has taken on new urgency as predictions about the 
African outbreak have grown more dire. This year, the US National 
Institute of Allergy and Infectious Diseases, the Wellcome Trust, the 
European Union and various drug companies have channelled emer- 
gency funding into research on potential drugs and vaccines. And 
on 3 November, the US Institute of Medicine is convening a meet- 
ing in Washington DC to lay out an agenda for research on Ebola 
virus. That meeting takes place against growing worries that the virus 
might become endemic in parts of Africa, continuing to sicken and kill 
people for several years. 

Researchers understand that the public-health response to this out- 
break is a top priority, but they are also becoming more determined to 
understand this mysterious family of killer viruses. “What’s happening 
in West Africa is a desperate situation,’ Saphire says. “It makes people 
very willing to figure out what they need to do to solve problems.” m 


Erika Check Hayden is a reporter for Nature in San Francisco, California. 
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At least one roughly 10-metre-wide asteroid passes as close as the Moon each week (artist’s impression). 


Find asteroids to 
get to Mars 


Asteroid retrieval is a distraction, says Richard P. Binzel. 
Better steps to interplanetary travel abound. 


leap for humans in space. Yet consen- 
sus on even the smallest steps forward 
has proven elusive. In June, a US National 
Research Council report’ illuminated many 
options but offered no recommendations. 


[icrioria flight is the next giant 


Return to the Moon? Head straight to 
Mars? Pluck a boulder off an asteroid and 
tug it to lunar orbit, just so that idle astro- 
nauts have somewhere to go and something 
to do? NASA must decide which path to 
follow before President Barack Obama’s 


budget announcement in January 2015. 

Some options are better than others. The 
cost and complexity of human space explora- 
tion demands that each element be measured 
by its value towards the ultimate goal: Mars. 

But NASAs stated next priority will not 
contribute to that aim. Its Asteroid Redirect 
Mission (ARM)? is a multibillion-dollar 
stunt to retrieve part of an asteroid and 
bring it close to Earth where astronauts can 
reach it. It will require an ancillary space- 
craft deploying either a huge capture bag or 
a Rube Goldberg contraption resembling a 
giant arcade-game claw. Neither technology 
is useful for getting humans to Mars. 

There is a better way. Thousands of ship- 
ping-container-sized and larger asteroids 
pass almost as close as the Moon each year. 
Many, such as September's near-miss asteroid 
2014 RC, come closer. We need to find them 
far enough in advance, and abundant oppor- 
tunities for crewed missions will open up. 

This gateway for human space exploration 
requires three things: a thorough asteroid 
survey to find thousands of nearby bodies 
suitable for astronauts to visit; extending 
flight duration and distance capability to ever- 
increasing ranges out to Mars; and developing 
better robotic vehicles and tools to enable 
astronauts to explore an asteroid regardless 
of its size, shape or spin. The asteroid survey 
would also provide a prudent and overdue 
assessment of future impact hazards. 


PASSERS BY 

Asteroids orbit the Sun, most of them in 
the asteroid belt between Mars and Jupi- 
ter. Jupiter's gravity and other forces nudge 
a few onto paths that come within 40 mil- 
lion kilometres of Earth. The largest of 
these near-Earth asteroids (NEAs) is about 
30 kilometres across, although most meas- 
ure in metres. The smallest objects are most 
numerous and strike Earth frequently. 
Tonnes of their residual grains and peb- 
bles rain down from space every day; a few 
metre-sized objects hit each year. 

At about 20 metres across, the asteroid 
that lit the skies and shattered windows in 
Chelyabinsk, Russia, in 2013 demonstrated 
athreshold between merely delivering mete- 
orites and posing a significant hazard. A 
Chelyabinsk-like airburst occurs somewhere 
on Earth on average every 50 years, usually 
over the oceans. A 10-kilometre-wide 
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> impactor such as the one that hit Earth at 
the end of the Cretaceous period is, thank- 
fully, a one-in-hundred-million year event’. 
But asteroids can be friends, not foes. For 
nearly four decades*, NEAs have been recog- 
nized as human spaceflight destinations that 
are more accessible than the lunar surface. 
Owing to their minuscule gravity fields, a 
rendezvous with an asteroid merely entails 
sidling up and flying alongside one, with no 
need for a specialized landing craft. 
Asteroids whose orbits pass between Earth 
and Mars offer a range of milestone desti- 
nations for testing distance and duration 
capabilities of human spaceflight. The first 
missions might last weeks and not go far; 
later excursions could last months and ven- 
ture successively farther into interplanetary 
space, proving that Mars is within our grasp. 
A 2009 presidential task force’ recognized 
NEAs as a low-cost and achievable option 
along such a ‘flexible path’ to Mars. Obama 
set course gently by mentioning the word 
‘asteroid’ and the year ‘2025’ in a speech to 
a receptive crowd of space workers in April 
2010. Although barely a murmur compared 
with President John F. Kennedy’s exhorta- 
tion in May 1961 to go to the Moon, NASA 
planners took Obama's words as a directive. 
Reality soon set in: getting humans to 
any known near-Earth asteroid by the mid- 
2020s was deemed beyond the range and 
budget of emerging flight systems. So the 
ARM scheme was hatched: instead of send- 
ing humans to an asteroid, let’s bring a piece 
of one to within the anticipated reach of a 
2025 crew. An uncrewed solar-electric pro- 
pulsion vehicle would capture the booty and 
tug it back to a lunar orbit, where astronauts 
launched separately could explore it. 
Mission accomplished? Not at all, in my 


MISSION REQUIREMENTS 


view. Hardware and operations to capture, 
contain and redirect an asteroid are dead- 
end elements with no value for long-dura- 
tion crewed space travel’. Delivering a 
supply module to lunar space would be a 
more sensible way to demonstrate solar-elec- 
tric propulsion and benefit astronaut safety. 

Conveying to the 


“Whyretrieve public that reaching 
an asteroid Mars requires patient 
when we can and diligent progres- 
wait foroneto  sionin capabilities is 
come nearus?” _ thehonest alternative 


to distracting them 
with a one-off costly stunt. 

Equally specious in my opinion are argu- 
ments that ARM can deliver important 
new information on asteroid hazards or 
space resources. The ARM target would 
be scarcely one-quarter the size and one- 
hundredth the mass of the Chelyabinsk body 
— too small to survive atmospheric passage. 
And the idea that we might benefit any time 
soon by extracting water or rocket fuel out 
of an asteroid is fantastical, owing to its cost 
and complexity. 


EXPLORATION GATEWAY 

NASA needs a more pragmatic plan. 
Rededication to Mars as the long-term goal 
is clearly necessary (independent of any 
decision regarding a return to the Moon). 
Near-Earth asteroids are the most accessible 
interplanetary stepping stones to Mars. Why 
retrieve an asteroid when we can wait for one 
to come near us? 

There are plenty to choose from (see 
‘Mission requirements’). The roughly one 
thousand objects of tens of metres and larger® 
currently known to pass within a few times 
the distance to the Moon are just the tip of the 


A mission to a near-Earth asteroid would require less propulsion and a shorter mission duration 
than a human mission to Mars. Less than 1% of the most-accessible asteroids are currently known 
(yellow circles), but a dedicated survey (filling in the yellow-hatched region) would reveal abundant 
asteroid stepping-stone opportunities as a gateway for human interplanetary exploration. 
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iceberg (see go.nature.com/9db89t). Nearly 
10 million more such objects await discovery 
in orbits passing between Earth and Mars; 
one or more fly inside the Moon’s orbit every 
week. So far, we have detected scarcely 0.1% 
of them because our current asteroid survey 
network is not up to the task. 

The asteroids that are optimally located for 
exploration are also among those posing a 
potential impact hazard to Earth. ‘Find them 
before they find us’ has been the mantra of 
planetary scientists*’ and national reports*” 
for decades. Yet neither the US Congress nor 
NASA has funded a large, dedicated tele- 
scopic survey. NASA spends only a minuscule 
fraction of its budget — $US20 million per 
year — on searches using a haphazard array 
of adapted astronomical facilities. 

Current surveys also fall short of US law. 
The George E. Brown, Jr. Near-Earth Object 
Survey Act of 2005 requires NASA to find 
90% of hazardous asteroids 140 metres or 
larger by 2020, a goal that will not be met 
without significantly more investment. It is 
a grand challenge requiring a grand com- 
mitment from the space agency, the presi- 
dent and Congress: $200 million per year. 
This budget line, comparable to NASA's 
New Frontiers programme that has sent 
uncrewed probes to the Solar System’s plan- 
ets, would allow the survey to be completed 
within a decade at a fraction of ARM’s pro- 
jected multibillion-dollar cost. 

Once humans can reach one asteroid in its 
native orbit, the gateway is opened such that 
hundreds (if not thousands) more will be 
accessible, enabling a steady programme of 
exploration to be unrolled in the late 2020s 
and 2030s. Commercial development of 
space resources on select candidates could 
begin to be evaluated towards the middle of 
this century. 

Robotic operated vehicles (ROVs) will 
need to be developed to explore asteroids, 
just as astronauts in orbit around or on 
the surface of Mars will need to command 
robotic workhorses. Most asteroids are irreg- 
ularly shaped, boulder-strewn and tumble in 
their orbits. We have no reason to expect an 
accessible asteroid to have a slow and steady 
spin and safe, smooth surface hospitable to 
a space suit. ROVs such as those used for 
underwater exploration could be directed 
from the main crew module to carry out 
in situ investigations. With an array of arms 
or grappling devices to gain a toehold nec- 
essary for working in microgravity, and the 
capability to operate longer hours than pos- 
sible for a spacewalk, exploration of these 
worlds could be achieved. 


GRAND CHALLENGE 

Over the next two months, Obama's 2015 
budget will be shaped. NASA needs to make 
a clear choice about its priorities. It should 
abandon the ARM mission concept and 
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make an asteroid survey its top priority to 
provide a basis for future crewed missions. 

NASAs directorates for human explora- 
tion, space technology, science and others 
must pool resources to address the agen- 
cy’s declared grand challenge. Congress 
and the White House should fund a new 
series of missions that embrace the result- 
ing synergy of exploration, technology and 
science for the benefit of humanity. 

Such a ‘Grand Challenge Mission’ pro- 
gramme should support human space- 
flight and humankind’s future. It should 
be modelled after, and with a budget sim- 
ilar to, the New Frontiers programme of 
planetary probes each costing less than 
$800 million and selected through com- 
petition. Advance definition of objectives 
would drive development by industry 
and academia towards the best ideas and 
optimize long-term planning. Competi- 
tive selection of proposals ensures the 
most cost-effective return on taxpayer 
investment. 

Three asteroid-related concepts should 
be explored: an asteroid survey to find a 
series of human destinations on the path 
to Mars while fulfilling the requirements 
of the 2005 survey act; a competition to 
test robotic asteroid-deflection methods 
on which civilization’s survival could one 
day depend; and another to test robotic 
methods of extracting water or mining 
other valuable resources from asteroids 
that might contribute towards sustaining 
human spacefaring decades from now. 

NASA needs to get back on track 
towards achieving humankind’s next 
giant leap in space. = 
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Is your most cited 


work your best? 


John P. A. Ioannidis and colleagues asked the most 
highly cited biomedical scientists to score their 
top-ten papers in six ways. 


fter half a century of citation indices, 
A= intriguing questions remain. 
Are the most highly cited papers 
the most important ones? Does science 
make progress mostly through evolution or 
through revolution? Are these two processes 
mutually exclusive or complementary, and 
which do high citations most reflect? Are 
surprising findings difficult to publish? 
Highly cited papers are nodes in the 
network of the dissemination and discus- 
sion of scientific information. But citation 
counts alone cannot reveal why a paper is 
considered so important as to attract repeated 
mention by other researchers. To contribute 
to these debates, we surveyed the most-cited 
authors of biomedical research for their views 


on their own influential published work’. 

We got some intriguing feedback. The 
vast majority of this elite group felt that 
their most important paper was indeed one 
of their most-cited ones. Yet they described 
most of their chart-topping work as evolu- 
tionary, not revolutionary. 


BEST OF THE BEST 
We listed the 400 most-cited biomedical 
scientists in the period 1996-2011*. We 
selected each author’s ten most-cited papers 
(adjusted for publication year) published in 
2005-08, and asked them to score the papers 
(ona scale of 0 to 100) in six dimensions. 
We restricted the period to 2005-08, 
because the perception of the importance 
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> ofa paper can change over time’. Old, 
highly cited papers might become stereo- 
typed®: people unquestioningly treat them 
as canonical. Recent papers (those published 
within the past five years), have had insuf- 
ficient time to accrue citations. 

Overall, 123 scientists responded, scoring 
1,214 papers between them. On average, 
investigators tended to give their block- 
buster papers high scores for dimensions 
that reflect evolution: Continuous Progress, 
Broader Interest and Greater Synthesis (for 
definitions of these terms and extended data, 
see Supplementary Information; go.nature. 
com/76fqq2). They gave the papers lower 
scores on average for dimensions that reflect 
revolution: Disruptive Innovativeness and 
Surprise (see ‘Self assessment’). They also 
indicated that blockbuster papers were easy 
to publish, with some exceptions. 

Twenty scientists (16%) felt that their most 
important paper published in 2005-08 was 
not among their top ten most cited. However, 
most of these 20 papers were still heavily cited 
(on average in the top 3% published in the 
same year in terms of citations; seven were in 
the top 15 papers that the author published in 
2005-08). Authors scored these papers higher 
for Disruptive Innovativeness (in nine cases) 
and Surprise (in five cases) than their ten 
most-cited papers. 

Fifty-two papers were appraised by at 
least two authors. We evaluated co-author 
agreement by comparing the scores for each 
dimension to their median values for that 
author. We considered the paired responses 
as an example of an agreement if both 
authors scored a paper above their median, 
below their median or at their median. The 
expected proportion of agreement given 
random responses is 50%. The rate of agree- 
ment ranged from 74% to 86% for the six 
dimensions, but the sample size was limited. 


SURPRISE, SURPRISE 

Predictably, the strongest correlations were 
between Disruptive Innovativeness and 
Surprise, and between Surprise and Publica- 
tion Difficulty. We had expected that papers 
would be perceived as either evolutionary 
or revolutionary, not both. A skew towards 
evolution or revolution would have shown 
up as a strong negative correlation between 
Disruptive Innovativeness and Continuous 
Progress, but this relationship was not statis- 
tically significant. Instead, Broader Interest 
scores correlated with strong scores for both 
revolution and evolution. 

Our survey, of course, has limitations. 
First, only just over 30% of the contacted 
authors replied. The non-responders might 
have given higher scores for revolution or 
evolution. Second, we assessed papers 
published only in a narrow time frame, 
in an effort to achieve uniformity. Third, 
authors may appraise their own work 


SELF ASSESSMENT 


One hundred and twenty-three of the most highly cited biomedical 
scientists scored their ten most-cited papers (1,214 between them) 


published in 2005-08 in six dimensions. 
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more positively than work by others. 

A fourth caveat is that the sample of the 
most-cited scientists is biased in favour of 
individuals whose work has already been 
widely accepted. Those with paradigm- 
changing ideas that are not so accepted 
(and thus cited) would not be in this group. 
Finally, scientists whose citations are closer 
to the average may have scored these highly 
cited papers or their own middle-ranking 
papers quite differently. 

We suspect that among more averagely 
cited work, evolution scores would have 
been even more prominent. True out-of- 
the-box innovation and major surprises are 
probably not common across the literature. 


NOW WHAT? 

This small survey is a salutary reminder 
both of how much information citations can 
convey, and of how much about the science 
of science we have yet to understand. For 
example, given that revolutionary papers 
are quite rare, how might we identify them 
early? Do innovative papers make connec- 
tions between areas of knowledge that are 
not typically made’, or do they get cited early 
on by papers in remote fields? 

In the future, with all the low-hanging fruit 
plucked, will we see (among the most-cited 
papers) a relative drop in the percentage of 
revolutionary papers and a corresponding 
increase in papers that provide a synthesis of 
the literature? How quickly does stereotyping 
of opinion about the importance of a paper 
happen, and how quickly and how much do 
such opinions change over time? What pro- 
portion of the most important papers across 
each of these six dimensions might be found 
among the output of the large majority of sci- 
entists with more-average citation profiles? 

It would be particularly useful to know 
whether successful out-of-the-box ideas are 
generated and defended largely by the most 
influential scientists or by colleagues lower 
on the citation rankings. Would the opinion 
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of scientists who cited the top papers that we 
examined square with the opinions sum- 
marized here? Are there other dimensions 
in addition to the six that we examined 
that might capture the essence of impor- 
tant work? For example, some respondents 
pointed to the significance of translational 
potential and social impact for research, 
which might have been captured only in part 
under our Broader Interest dimension. And 
it would be interesting to know whether there 
are major differences in the evolution-versus- 
revolution pattern in the physical sciences. 
One way to answer some of these questions 
would be to survey those who cite the highly 
cited papers or investigators with more- 
modest citation rankings. We must continue 
to hone indices other than citation-based 
metrics to complement appraisal of scientific 
accomplishment’. m SEE NEWS FEATURE P.550 
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A nineteenth-century watercolour showing the effects of lupus, an autoimmune disease. 


AUTOIMMUNITY 


When the body betrays 


Tilli Tansey surveys a magisterial, historically rich biography of autoimmunity. 


terrifying incident opens Intolerant 
At In 1981, novelist Joseph 

Heller became paralysed within 
a few hours by the autoimmune disease 
Guillain-Barré syndrome. In this horrific 
physiological own goal echoing the catch-22 
of his classic book, Heller’s immune system 
turned on his peripheral nervous system. 

It has been calculated that in any popula- 
tion, 5-10% of people — a large proportion 
of them women — will at some point have 
an autoimmune disease. Intolerant Bodies, 
by the medically trained historian Warwick 
Anderson and clinical immunologist Ian 
Mackay, draws on medical, scientific, his- 
torical and literary sources to highlight four 
such conditions: type 1 diabetes, rheumatoid 
arthritis, systemic lupus erythematosus and 
multiple sclerosis. 

The term autoimmune first appeared in 
1951, and autoimmunity in 1957, although 
neither was then — or now — universally 
accepted. Some immunologists continue to 
prefer ‘autoallergy’; the late Robin Coombs, 
inventor of the first specific test for red-cell 


antibodies, declared 
that he would “go 
down fighting” in his 
opposition to ‘auto- 
immune’. 

Mackay and Ander- 
son focus less on the 
heyday of autoim- 
munity in the past few 
decades than on its 
‘prehistory’ and emer- 
gence asa defined field 
of medical research 
and practice. Early- 
nineteenth-century 
studies of fevers led to 
the recognition that 
the body’s own physiological responses could 
be harmful, and germ theory later that cen- 
tury spurred knowledge of the body’s defence 
mechanisms. Medical luminaries such as 
Robert Koch and Joseph Lister struggled 
to understand basic mechanisms and iso- 
late causative agents. In the 1880s, the Rus- 
sian biologist Elie Metchnikoff argued that 


Intolerant Bodies: 
A Short History of 
Autoimmunity 
WARWICK ANDERSON 
AND IAN R. MACKAY 
Johns Hopkins 
University Press: 2014. 


564 | NATURE | VOL 514 | 30 OCTOBER 2014 


© 2014 Macmillan Publishers Limited. All rights reserved 


circulating cells, the phagocytes, digested 
invading cells. In the first half of the twen- 
tieth century, in public-health labs and early 
pharmaceutical companies, the development 
of serotherapy — treating infectious diseases 
such as diphtheria by injecting serum raised 
in an immunized animal — stimulated much 
research on the immune system. 

The authors emphasize how after the 
Second World War, an increase in govern- 
ment and private funding led to the expan- 
sion of research laboratories, including Henry 
Kunkel’s at the Rockefeller Institute in New 
York City, Macfarlane Burnet’s at the Walter 
and Eliza Hall Institute in Melbourne, Aus- 
tralia, and those at industrial concerns such 
as pharmaceutical giant Merck. Researchers 
developed tools, techniques and languages 
based on cellular and molecular approaches. 
In Britain, Peter Medawar unravelled the 
mechanisms of rejection after tissue trans- 
plantation. In New York City, Jules Freund 
demonstrated that an emulsion of paraffin oil 
and dead mycobacteria, the causative organ- 
isms of tuberculosis, enhanced and prolonged 
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immune reactions in animal models of 
disease. Burnet, who shared a Nobel prize 
with Medawar in 1960, recognized Freund's 
contribution as key to the experimental study 
of autoimmunity. The subject was now high 
on the agenda of many biomedical scientists. 
Work on subsets of lymphocytes, human- 
tissue antigens and the genetics of histo- 
compatibility factors all contributed to the 
understanding of cellular mechanisms, and 
to devising diagnostic and therapeutic strate- 
gies for an array of disorders. 

But Intolerant Bodies is more than a his- 
tory of scientific concepts, clinical practices 
and experimental activities. It explores fun- 
damental questions such as “What is self?’ 
— how and why the body’s sophisticated 
defence mechanisms fail to recognize its 
cells and tissues and start to destroy them. 
The authors draw on immunological, 
philosophical, psychological and religious 
literature for this intriguing discussion. They 
mine thoughts from the likes of philosopher 
Jacques Derrida and anthropologist David 
Napier for their reflections on the defini- 
tions, boundaries and limitations of self. 

Furthermore, Anderson and Mackay reveal 
an expert understanding of how to use ‘lived 
experience’ to bring a biography of disease 
to life. Personal accounts demonstrate how, 
as theories about the causes of inexplicable 
chronic and debilitating diseases abounded, 
the variety of treatments devised to alleviate 
or ‘cure them expanded. The nineteenth-cen- 
tury German poet Heinrich Heine was treated 
with leeches, sulphur baths and morphine, but 
these did little for an aggressive neurological 
complaint that the authors suggest was multi- 
ple sclerosis. And in 1957, US novelist Flan- 
nery O'Connor, who had lupus, wrote to a 
friend, “Every time something new is invented 
I get in on the ground floor with it. There have 
been fine improvements in the medicine”. But 
steroids also destroyed her bones, a therapeu- 
tic stalemate that made her final years mis- 
erable. Particularly harrowing is the case of 
English diarist W. N. P. Barbellion (the pen 
name of Bruce Frederick Cummings), author 
of the 1919 Journal of a Disappointed Man. 
Barbellion, diagnosed with multiple sclerosis 
in his twenties, began to hoard laudanum and 
to keep a pistol close by. The disease killed him 
less than five years after diagnosis. 

Unlike many biographies, Intolerant 
Bodies has no neat ending. With more than 
80 conditions now considered to be auto- 
immune, diagnosis remains irregular and 
treatments unsatisfactory. Although more 
and more is understood about mechanisms, 
the causative stimuli are still unknown. The 
history of autoimmunity is far from over. = 


Tilli Tansey is professor of the history of 
modern medical sciences at Queen Mary, 
University of London. 

e-mail: t.tansey@qmul.ac.uk 


Books in brief 


Making Space: How the Brain Knows Where Things Are 

Jennifer M. Groh BELKNAP (2014) 

The timing is spot-on for this study of the brain’s navigational system, 
with a Nobel prize awarded to John O’Keefe, May-Britt Moser and 
Edvard Moser this month for their research in the field (See Nature 
514, 154-157; 2014). Neuroscientist Jennifer Groh deftly elucidates 
the mental computations that allow understanding of location 

and boundaries, interweaving well-judged snippets of history. The 
mechanisms, such as the brain’s updates on eye movements, are 
fascinating — as is Groh’s revelation that neurons can “do double 
duty” in tasks such as spatial navigation and memory. 


The Peripheral: A Novel 

William Gibson PUTNAM ADULT (2014) 

With Neuromancer (Ace, 1984), which predicted the rise of the 
Internet, science-fiction writer William Gibson revealed his nose for 
near-future science and for seeing, as he puts it, a future already 
here but not “evenly distributed”. He harnesses that ability anew for 
this techno-thriller. From plastic in the oceans to three-dimensional 
printing, today’s concerns are beautifully repurposed as tomorrow’s 
background. But for Gibson, technological change is just a different 
route to the same problems: his protagonists, who wield the 
humanoid-telepresence ‘peripherals’, are socially marginalized. 


Note-by-Note Cooking: The Future of Food 

Hervé This (transl. M. B. DeBevoise) COLUMBIA UNIVERSITY PRESS (2014) 
Molecular cooking, that scientific wonderland of filter pumps and 

~ INOTE: A rotary evaporators, has evolved. The new revolution, proclaims 

: N OTE physical chemist Hervé This, is “note-by-note cooking” — the 

N COOKING 4 orchestration of pure molecular compounds. Amid the dollops of 

> theory are practical examples, such as chef Pierre Gagnaire’s glucose 
“péligot” caramels (exemplars of contrasting consistency), and 

lists of compounds including the mushroom-scented 1-octen-3-ol. 
Recipes are scant, although Effervescence, a cocktail involving Syrah 
grape polyphenols and ethanol, sounds toast-worthy. 


_— The Perfect Meal: The Multisensory Science of Food and Dining 

The Perfect ro Charles Spence and Betina Piqueras-Fiszman WILEY (2014) 

‘ Anyone who thoughtlessly masticates their way through a meal 

is missing something extraordinary. So might opine experimental 
psychologists Charles Spence and Betina Piqueras-Fiszman, who 
have mined neuroscience and behavioural economics for this 
study of the social, sensory and psychological factors that optimize 
the pleasures of the table. Their exhaustive analyses of everything 
from plate size to flavour incongruities are laced with details such 
as cutlery’s evolution from Victorian marrow scoops and chocolate 
muddlers to today’s textured spoons and “aromatic forks”. 


Bee Time: Lessons from the Hive 

Mark L. Winston HARVARD UNIVERSITY PRESS (2014) 

From the whirr of wings to the whiff of honey, the “full-body 
experience” of working in apiaries has, for biologist Mark Winston, 
sparked insights into humanity’s relationship with nature. In this 
personal and scientific journey into the history we share with bees, he 
ranges over neonicotinoid pesticides and colony collapse, the control 
of African ‘killer’ bees and more. The charismatic social insects 
emerge as both icons of societal cohesion and symbols of nature’s 
paradoxically mingled power and fragility. Barbara Kiser 
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Constructor of a marvellous canon 


George Szpiro weighs up a life of John Napier, who gifted science with logarithms. 


The Lesson, a professor asks a student to 
multiply 3,755,998,251 by 5,162,303,508. 
To his amazement, the answer comes quick as 
a shot: 19,390,002,844,219,164,508. The stu- 
dent declares that she had “learnt all possible 
results of all possible multiplications by heart” 

If only it were that simple. (Indeed, 
the student got it wrong: the answer is 
19,389,602,947,179,164,508.) Multiplica- 
tion is computationally complex. The num- 
ber of mathematical operations required to 
add two numbers grows linearly with the 
number of digits involved, but in multipli- 
cation using simple arithmetic the number 
of operations grows with the square of the 
number of digits. Computer algorithms 
now simplify this, but in centuries past 
multiplications were the bane of scientific 
research. The seventeenth-century astrono- 
mer Johannes Kepler, for instance, had to 
perform thousands of them to compute the 
positions of celestial objects. 

His task would have been much more dif- 
ficult had it not been for a discovery more 
than a decade before by mathematician John 
Napier: logarithms. Tables of these then- 
mysterious numbers meant that two figures 
could be multiplied by adding their logath- 
rithms. In John Napier, maths writer Julian 
Havil traces the life and work of the man who 
presented us with this gift 400 years ago. 


I: Romanian playwright Eugene Ionescos 


Aish 


Napier’s bones, an early calculation 
machine that used logarithms. 


Napier was born in 
Edinburgh in 1550, 
14 years before Gali- 
leo. Little is known 
about his youth and 
education except that 
he travelled, presum- 
ably in Europe, and 
returned to Scotland 
a scholar at 21. He 
became a gentleman 


John Napier: Life, 
Logarithms, and 


farmer, theologian Legacy 
and politician as well JULIAN HAVIL 
ee Princeton University 
as a mathematician. 
Press: 2014. 


A passionate intellec- 
tual, he often paced 
the countryside deep in thought in a dark 
cloak — his academic gear — which led 
neighbours to believe that he was in league 
with the devil. His groundbreaking book on 
logarithms, Mirifici Logarithmorum Canonis 
Descriptio, was published to little fanfare in 
1614. It garnered less attention in his era 
than his 1593 blockbuster on the dangers of 
Catholicism, A Plaine Discovery of the Whole 
Revelation of St. John. 

Descriptio consists of tables of logarithms 
and instructions on how to use them, in 
particular for multiplications and divisions 
in trigonometry, by adding and subtracting 
their entries. Within years of its publication, 
English mathematician William Oughtred 
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used Napier’s logarithms to invent the slide 
rule, which remained indispensable for 
science, engineering and economics until the 
advent of the pocket calculator in the 1970s. 

Napier published Descriptio without 
revealing how he had arrived at the loga- 
rithms. That secret was exposed only in 
Mirifici Logarithmorum Canonis Constructio, 
published after his death by his son Robert 
and friend Henry Briggs (to whom we owe 
the common logarithm to base ten). In Con- 
structio, we find that Napier had discovered 
the numbers by considering two objects 
moving at different speeds along straight 
lines, and establishing a relationship between 
their locations at every moment. The most 
salient feature of logarithms is that they are 
needed to work out x in equations of the form 
10*= 1,000. (In this case, x=log,,(1,000) =3.) 

However, it took more than a century to 
clarify the relationship between logarithms 
and exponents (the power to which a number 
is raised). That was the work of Welsh mathe- 
matician William Jones, famous for being first 
to use 7 for the ratio of a circle’s circumference 
to its diameter. Jones's contribution to Napi- 
er’s work was unearthed only in 1771, long 
after his death, when Royal Society librarian 
John Robertson reported that the Welsh- 
man had formulated the law of logarithms, 
log(ab) = log(a) + log(b). This is exactly the 
relationship that Napier had sought to make 
calculations less tedious. Although developed 
to simplify computations, logarithms turned 
out to be of supreme importance for pure and 
applied mathematics, and for all branches 
of science, from physics and chemistry to 
psychology, information theory, computer 
science and economics. 

Unfortunately, Havil, like Napier, keeps the 
reader in the dark about the essential nature 
of logarithms for most of the book. It would 
have been helpful to introduce logarithms in 
the way taught to most secondary students 
now — as functions that solve for exponents 
in simple equations — and move on from 
there. Furthermore, interminable computa- 
tions, laid out in minute detail often over sev- 
eral pages, may discourage many readers; and 
the narrative suffers from the frequent use 
of long quotations in seventeenth-century 
English. Nevertheless, John Napier fills a gap 
concerning an important, and often ignored, 
chapter of mathematical history. m 


George Szpiro is New York correspondent for 
the Swiss newspaper Neue Ziircher Zeitung. 
e-mail: georgeszpiro@gmail.com 


SCIENCE SOURCE/SPL 


Correspondence 


Can GM marmoset 
use be justified? 


We question the premise that 
genetically modified (GM) 
marmosets are essential to 
Japan's brain-mapping project 
for studying disorders such as 
schizophrenia and depression 
(Nature 514, 151-152; 2014). It 
runs counter to efforts in fields 
such as toxicology to reduce or 
eliminate the use of non-human 
primates in research. 

Our conclusions in the 2011 
Review of Research Using Non- 
Human Primates (see go.nature. 
com/kgy27w), which included 
UK neuroscience research 
between 1997 and 2006, are 
relevant here. The review judged 
such research to be of high 
quality, yet few projects showed 
evidence of biomedical benefit 
— despite being completed many 
years before the review. 

There was a tendency among 
researchers to make over- 
optimistic and unsubstantiated. 
claims for their work and how 
it might improve the treatment 
of brain disorders, presumably 
to strengthen their case for 
funding. We concluded that “the 
size of the problem to which the 
science relates should not be 
accepted as sole justification for 
individual items of research” and 
that “health benefits should only 
be claimed when their potential 
is real”. In some instances, we 
judged that the case for using 
non-human primates — rather 
than other species, including 
humans — was not well made. 

It could be argued that the 
unavailability of suitable GM 
animals at the time might 
account for the poor translation 
into health benefits. But the 
justification for using GM 
marmosets in research today 
needs to be much stronger, 
because of the substantial ethical 
and welfare issues at stake. 

You quote US neuroscientist 
Terry Sejnowski, who proposes 
consideration of “the ethical 
issues that will inevitably arise 
up the road”. We contend that 
these should be considered 


before the journey starts. 
Patrick Bateson Cambridge, UK. 
C. Ian Ragan London, UK. 
ian.ragan1@btinternet. com 


Sustainability: call to 
coordinate actions 


The first step in implementing 
the United Nations’ Sustainable 
Development Goals (SDGs; 
see M. Stafford-Smith Nature 
513, 281; 2014) will be to 
identify critical nodes at which 
issues such as water, energy 
and food need to be addressed 
simultaneously and that have the 
potential to set the agenda. 

For example, schools could play 
a part in increasing water, food 
and energy security by improving 
students’ knowledge of these 
systems as well as of flood and 
drought responses. These nodes 
could help to coordinate existing 
discussion structures into new 
national and local forums, and to 
overcome bureaucratic silos. 

Contribution by governments 
to the SDGs will be voluntary, so 
efforts should focus on increasing 
the numbers of organizations, 
people and networks that can be 
linked to the SDG governance 
system. This would allow the 
system to address inequities in 
gender, geographic location and 
ethnicity, for example, at regional, 
national and local scales. 

Effective monitoring will be 
pivotal to the success of the SDGs. 
If done globally through existing 
UN agencies, there is a risk that 
key resources would be diverted, 
whereas monitoring by individual 
countries might be inadequate 
or inconsistent. The UN High- 
level Political Forum could have 
an important role, but people 
outside the current sustainability 
discussions need to be heard as 
well. They too can help to craft 
context-specific goals and targets, 
maintain ambition and develop 
complex monitoring capacity. 
Norichika Kanie Tokyo Institute 
of Technology, Japan. 
Casey Stevens Providence 
College, Rhode Island, USA. 
kanie@valdes.titech.ac.jp 


Exempt green tech 
from trade rules 


Next year in Paris, parties to the 
UN Framework Convention on 
Climate Change have the task of 
developing a global agreement 
to reduce carbon emissions — a 
successor to the failed Kyoto 
Protocol. They should strike a 
grand bargain with the World 
Trade Organization (WTO) to 
promote green industry. 

For example, the new protocol 
could draw up an open-ended list 
of products and processes that 
would reduce emissions. This 
could then be used to exempt 
countries from the rules of fair 
trade for, say, five years, to allow 
them to build green industries. 

Trade in green industries is 
crucial for transforming fossil- 
fuelled economies. Countries 
trying to promote a green 
economy should not have to 
experience trade disputes (see, for 
instance, go.nature.com/7joyyk). 

There is a precedent for such 
exemptions in the WTO’s General 
Agreement on Tariffs and Trade. 
The negotiation ofa new ‘Paris 
Protocol’ provides a once-in-a- 
generation chance to create green 
industries and to cut emissions. 
John A. Mathews Macquarie 
University, Sydney, Australia. 
john.mathews@mgsm.edu.au 


Engage more cohort 
patients in research 


Input from patients and the 
public is a requirement for 
funding by the UK National 
Institute of Health Research 
(NIHR), but is met by scepticism 
from some academics. 

We analysed 70 birth cohort 
studies to evaluate the basis of this 
scepticism (P. J. Lucas et al. BMC 
Med. Res. Methodol. 13, 56; 2013). 
Such studies rely on recruiting 
and retaining many people over 
many years — often a challenge. 
Retention was a factor in the June 
decision to pause the US National 
Children’s Study, for example (see 
go.nature.com/mfmhhr). 

These problems might 


be overcome by consulting 
participants about research 
design and processes, rather 
than using them solely for data 
provision. But we found very few 
examples. Some scientists feared 
that involving participants could 
compromise data integrity; others 
lacked the right resources or were 
ignorant of the practice. 

There is much to be gained 
from engaging participants 
in research, but persuading 
academics to do so will require 
better support and for other 
funders to follow the NIHR’s lead. 
Patricia Lucas, Maggie Leggett 
University of Bristol, UK. 
Simon Denegri NIHR, London. 
maggie. leggett@bristol.ac.uk 


Translating public 
action into policy 


Last month, millions took to the 
streets in climate rallies organized 
by some 1,500 organizations in 
158 countries. Their message: 
that world leaders at the United 
Nations Climate Summit in New 
York must tackle the challenges 
of global warming head on. 
However, it is by no means clear 
that this will translate into firm 
political pledges (see Nature 
http://doi.org/v3c; 2014). 

For instance, 80,000 people 
demonstrated ahead of the UN 
Climate Change Conference 
in Copenhagen in 2009. At the 
meeting, a political collusion 
between the United States anda 
group formed by Brazil, South 
Africa, India and China ended 
up sidelining the countries most 
in need of action against climate 
change (see A. P. Kythreotis Prog. 
Hum. Geog. 36, 457-474; 2012). 

When it comes to crucial issues 
of social justice and the economy, 
politicians should be guided by 
the people they are supposed to 
serve (see A. Petherick Nature 
Clim. Change 4, 81-83; 2014). 
Chrystal Mantyka-Pringle 
University of Saskatchewan, 
Saskatoon, Canada. 

Andrew P. Kythreotis Cardiff 
University, UK. 
c.mantyka-pringle@usask.ca 
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EVOLUTIONARY DEVELOPMENTAL BIOLOGY 


Ghost locus appears 


The sequences of two sponge genomes provide evidence that the ParaHox developmental genes are older than previously 
thought. This has implications for animal taxonomy and for developmental and evolutionary biology. SEE LETTER P.620 


JAMES 0. MCINERNEY & 
MARY J. O'CONNELL 


narray of transcription factors controls 
Ameen inanimals'. Transcription- 

factor-encoding genes belonging to 
one class — Antennapedia (ANTP) — are 
present throughout the animal kingdom and 
usually have a key role in development. The 
ANTP group includes the Hox, ParaHox and 
NK genes, all of which are paralogues, mean- 
ing that they have arisen in different animals 
from a shared ancestor as a result of gene- 
duplication events. However, the origins, evo- 
lution and, in particular, the timing of these 
duplication events have been unclear. On 
page 620 of this issue, Fortunato et al.’ report 
the analysis of two new genomes from sponges 
of the class Calcarea (the calcisponges) 
that substantially clarifies the evolutionary 
history of the ANTP genes and provides insight 
into tissue structures that are shared across the 
animal kingdom. 

A substantial puzzle has arisen concerning 
the repertoire of ParaHox and Hox genes 
in animals. The first sponge genome to be 
sequenced was that of the demosponge (class 
Demospongiae) Amphimedon queenslandica’. 
Although there seem to be no Hox or Para- 
Hox genes in this genome, the evolutionary 
conservation of clusters of genes known to 
be neighbours of Hox and ParaHox genes in 
other organisms led to the proposal that Hox 
and ParaHox genes were present in the com- 
mon ancestor of all animals but had been lost 
in sponges. The researchers called their idea 
the ‘ghost locus’ hypothesis. However, this 
evidence, although intriguing, was indirect, 
because it was based on an inference of ances- 
tral gene content. 

Despite their simple morphologies, sponges 
from different classes are thought to be quite 
divergent and indeed it is not certain that they 
all form a monophyletic (natural) group*”. If 
they are monophyletic’, then their last com- 
mon ancestor dates to approximately 600 mil- 
lion years ago; if they are a paraphyletic group, 
then the time at which the Calcarea and the 
Demospongiae diverged, for instance, might 
be as much as 800 million years ago*. Either 
way, the antiquity of their divergence means 
that the genomes of a demosponge and a 
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Figure 1 | Evolution of Hox, ParaHox and NK 
genes in animals. Fortunato and colleagues’ 
comparison’ of two calcisponge genomes (phylum 
Porifera) with genome data from other animals, 
including placozoans, cnidarians and bilaterians, 
suggests that an ancestral Antennapedia (ANTP) 
gene duplicated to give rise to the ancestor of 

NK genes and a ‘ProtoHox’ gene. The ProtoHox 
gene subsequently duplicated to give rise to 

the existing Hox and ParaHox gene families. 
Animal diversification occurred after these 
gene-duplication events, but gene losses have 
complicated the picture, explaining, for example, 
the seeming lack of Hox genes in calcisponges and 
of both Hox and ParaHox genes in demosponges. 


calcisponge might be expected to differ 
from one another more than is usual for two 
animals from the same phylum, such that a 
single representative of the sponges can never 
be emblematic of the sponges — the phy- 
lum Porifera — as a whole. Sampling these 
divergent taxa is therefore crucial if we are 
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to understand the evolution of the genetic 
circuitry that effects animal development. 

A ‘superphylum’ called the ParaHoxozoa® 
has been defined on the basis of the presumed 
shared presence of Hox or ParaHox genes. 
This group includes the Bilateria (compris- 
ing all animals with bilateral symmetry), the 
Placozoa (simple, flat animals that live on the 
ocean floor) and the Cnidaria (radially sym- 
metrical aquatic animals, such as jellyfish). 
The animals excluded from the ParaHoxozoa 
are the sponges and the Ctenophora (comb 
jellies), because of their presumed lack of 
these genes. 

Fortunato et al. provide direct support for 
the ghost locus hypothesis, at least for the 
ParaHox genes. By combining data from 
their sponge genomes with existing data for 
phylogenetic, gene-neighbourhood and gene- 
expression analyses, they suggest not only that 
developmental-gene evolution has been very 
different in sponges compared with most 
other animals, but also that early animals did 
indeed contain a repertoire of ANTP genes 
that included the NK and ParaHox genes at 
least (Fig. 1). 

The authors first constructed a phylogenetic 
tree of a large family of ANTP genes. Given 
the great phylogenetic breadth that is spanned 
by the tree, it is unsurprising that high statisti- 
cal support for relationships is not achieved. 
Nonetheless, the tree topology does indicate 
the presence of NK and ParaHox genes in the 
genomes of the calcisponges. 

Next, the researchers demonstrated that 
there is some conservation in the gene neigh- 
bourhoods between these putative ParaHox 
genes in sponges and their presumed equiv- 
alents in bilaterians. Conservation of gene 
neighbourhoods is a hallmark of the evolution 
of ANTP genes, but the authors find that the 
strong conservation observed in most animals 
is not found in sponges. This implies that con- 
servation of ParaHox gene neighbourhoods is 
nota universal rule as was once thought. 

Finally, the authors show that the expression 
patterns of the calcisponge ParaHox genes 
are remarkably similar to those of other ani- 
mals. For example, the choanoderm cell layer 
in sponges has a similar pattern of ParaHox 
expression to the endoderm in bilaterian 
animals, and the pattern in the sponges’ 
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cross cells is like that of sensory cells in other 
animals. This opens up the possibility that 
certain tissue types in sponges might have a 
common ancestry with other tissue types in 
bilaterian animals. However, it should be kept 
in mind that homology in developmental genes 
does not always equate to homology of organs’. 

One implication of Fortunato and colleagues’ 
findings is that, although the ParaHoxozoan 
superphylum may still be a useful taxonomic 
group, it can no longer be defined by the 
possession of ParaHox or Hox genes. In addi- 
tion, the results suggest that the origin of the 
ParaHox genes is earlier than was previously 
thought and that the first animals probably 
had a ParaHox gene repertoire (Fig. 1). This 
idea leads to the intriguing question of whether 
the common ancestor of all animals was in fact 
more developmentally complex than present- 
day sponges, cnidarians and placozoans, and 


ASTROPHYSICS 


that these groups have lost complexity, rather 
than that complexity has been gained in other 
animal lineages. 

Our existing sample of animal genomes has 
heavily influenced our current ideas about 
ANTP gene evolution. There are two other 
classes of sponge, the Homoscleromorpha*® 
and the Hexactinellida’, and a draft genome 
sequence for the homoscleromorph Oscarella 
carmela is available’. Investigating its ANTP 
gene repertoire should shed further light on 
the evolution of ParaHox and Hox genes. 
More generally, the sequencing of additional 
genomes from organisms at the sparsely 
sampled base of the animal phylogenetic tree 
will be key to refining our understanding of 
the changing complexity of animals over the 
course of evolution. = 


James O. McInerney is in the Department 


Secret ingredient 


exposed 


Astronomers have suspected for some time that magnetic fields are a key 
ingredient in the accretion of material that surrounds young stars. New 
observations have just begun to reveal these fields in action. SEE LETTER P.597 


CHRISTOPHER M. JOHNS-KRULL 


ost stars are born surrounded by 
Mie of gas and dust, and it is in 

these disks that planets form. How- 
ever, the gas and dust that makes up these 
disks (Fig. 1) does not all go into forming plan- 
ets. Indeed, much of the disk material slowly 
falls, or accretes, onto the newly formed star, 
setting its final mass. For many years, astrono- 
mers have studied such accretion disks and 


measured the rate at which the material accretes 
onto young stars, but exactly why such accre- 
tion occurs so efficiently has remained elusive. 
Over the past several years, researchers have 
started to zero in on a solution in theoretical 
studies for the evolution of an accretion disk 
that involves the action of magnetic fields. 
Unfortunately, until just this year there were 
no observations of the required magnetic 
fields in the disks. On page 597 of this issue, 
Stephens et al.' report how they have now clearly 
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detected magnetic fields in one such disk. 

The key to understanding how an accretion 
disk works is to figure out what causes some 
of the material to spiral inward and eventually 
merge onto the star. Viscosity in the disk is one 
possible mechanism. At each point in the disk, 
the material slightly inside this position moves 
around the star just a bit faster than the mate- 
rial just outside the given point. This difference 
in speed results in viscosity, a type of friction, 
that slightly slows down the material on the 
inside, causing it to fall somewhat closer to 
the star. This viscosity, acting at all radii in the 
disk, leads to the accretion of some of the disk 
material onto the central star. The greater the 
viscosity, the more rapid the accretion of mater- 
ial. The trouble is, although astronomers can 
estimate the viscosity in disks on the basis of the 
temperature and density of the material there, 
the result fails by many orders of magnitude to 
produce the rate of accretion that is observed””. 

Magnetic fields, through a process known 
as the magnetorotational instability’, have 
been proposed as a possible means by which 
disks accrete material. If the material in the 


Figure 1 | Accretion disks around newly formed stars. These visible-light images show disks of gas and dust (dark rings) around four infant stars (central 
bright regions) in the Orion Nebula, a star-forming molecular cloud about 400 parsecs away from Earth. The dust in the disks, which are surrounded by hot gas 
from the nebula, makes them look dark at visible wavelengths. Magnetic fields threading these disks, now observed directly by Stephens et al.' in the disk of 

the young star HL Tau, are probably the main element causing much of the observed disk material to accrete onto the stars. Each square region shown is about 


260 billion kilometres across. 
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disk is threaded through by magnetic fields, 
then different regions of the disk will be con- 
nected to each other by these fields. Although 
the temperature of the disk is relatively low 
compared with that of the star, the material is 
warm enough for a few ions and free electrons 
to form. The radiation from the star can also 
cause ion formation. Ions are forced to move 
with magnetic fields — essentially, they are tied 
to the field lines. Two ions at slightly different 
radii in the disk, tied to the same field line, will 
exert forces on each other that are transmit- 
ted through the magnetic field. Effectively, the 
magnetic fields act as springs holding the ions 
together. The ion closer to the star should be 
moving faster in its orbit than the ion slightly 
farther out, but the field tying them together 
causes the inner particle to slow down alittle 
and the outer particle to speed up, in exactly 
the same way as the viscosity described above. 
However, this magnetorotational-instability 
mechanism is much better at causing accre- 
tion than standard viscosity, and so seems 
to be a promising solution to the problem of 
how accretion disks work. But the question 
has been: are there magnetic fields in the disks 
around young stars? 

Yes, according to Stephens and colleagues’ 
study. The authors detected the magnetic fields 
in the disk of a young star by using the Com- 
bined Array for Research in Millimeter-wave 
Astronomy (CARMA) to measure the polari- 
zation of the light (the preferred direction in 
which the light’s electric field oscillates) that is 
emitted by the dust in the disk. Dust particles 
in interstellar space and in the disks of young 
stars are not perfect spheres — they are some- 
what oblong. They also spin, and usually have 
a small amount of electric charge, like the ions 
that are present in the gas. The result of their 
spinning oblong shape and electric charge is 
that the dust grains generally align their long 
axes perpendicular to the magnetic field’. This 
then means that they will emit polarized light. 

It is more likely that the light emitted by a 
dust grain will have its electric field aligned 
with the long axis of the grain. If the dust grains 
all have random alignment relative to each 
other, the resulting total emission of radia- 
tion will have no preferred orientation and no 
polarization. By contrast, if most or all of the 
dust grains are aligned with each other owing 
to a magnetic field, the total emission will be 
polarized. This is exactly what Stephens et al. 
saw in the disk surrounding the young star 
HL Tau. Furthermore, the authors were able 
to spatially map the disk around HL Tau and 
measure the polarization at several locations 
in the disk, making these observations unu- 
sually powerful for testing our understanding 
of accretion disks. Similar observations were 
also made in 2014 of an even younger star®; 
however, the orientation of its disk and poten- 
tial confusion from the surrounding envi- 
ronment did not allow as clear a study of the 
magnetic-field structure as that obtained by 


Stephens et al. in the disk of HL Tau. 

This result is exciting because it strongly 
suggests that magnetorotational instability 
may be the long-sought answer to just how 
accretion disks work. However, this is far 
from the last word on this astrophysical puzzle. 
The theoretical models of this phenomenon 
generally predict that the magnetic fields in 
the disk should be wrapped around the star, 
basically following the motion of the disk 
material as it orbits the star. Unfortunately, this 
is not exactly what Stephens and colleagues 
observed. Their polarization measurements 
of HL Tau’s disk indicate that the magnetic 
fields are all more or less pointing in the same 
direction instead of wrapping around the star. 
This is a mystery with no good explanation at 
this time. 

So what does the future hold for accretion- 
disk physics? A new, more sensitive radio- 
telescope array than the one used by Stephens 
and colleagues, the Atacama Large Millimeter/ 
submillimeter Array (ALMA) in Chile, has 
begun operations. This facility will provide 
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detailed views of the structure and magnetic 
fields of disks around young stars. We can hope 
that this will produce observational results that 
will also spur new theoretical investigations 
into the role of magnetic fields in accretion 
disks. Our understanding of this phenom- 
enon is not complete yet, but it has just taken a 
dramatic step forward. m 
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Enzyme-chromatin 
complex visualized 


The structure of an enzyme that is bound to a nucleosome — a protein complex 
around which DNA is wrapped — reveals how contacts between the two orient the 
enzyme so that it can modify a specific amino-acid residue. SEE ARTICLE P.591 


JURG MULLER & CHRISTOPH W. MULLER 


ina highly compact protein-DNA complex 
called chromatin. The basic building block 
of chromatin is the nucleosome, an eight-sub- 
unit core comprised of two copies of each of 
the histone proteins H2A, H2B, H3 and H4, 
around which DNA is wrapped in two helical 
turns. Many nucleosome-remodelling and his- 
tone-modifying enzymes modulate chromatin 
structure and thereby affect the transcription, 
replication and repair of gnomic DNA. How- 
ever, despite the importance of these enzymes, 
we have only a limited understanding of how 
they interact physically with nucleosomes. On 
page 591 of this issue, McGinty et al.' describe 
the crystal structure of the H2A-modifying 
module of an enzyme called Polycomb repres- 
sive complex 1, bound to a nucleosome. For the 
first time, the structure reveals how a histone- 
modifying enzyme recognizes its nucleosomal 
substrate. 
Polycomb repressive complex 1 (PRC1) acts 
with other protein complexes of the Polycomb 
group to repress the transcription of many of 


lE the nucleus, genomic DNA is organized 
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the genes that control developmental processes 
in animals. PRC1 can compact chromatin and 
inhibit nucleosome remodelling through a 
non-enzymatic mechanism”. But the complex 
also has ‘E3’ ubiquitin-ligase enzyme activity, 
which links a single ubiquitin molecule to an 
amino-acid residue, lysine (Lys) 119, in H2A° 
—a process called monoubiquitylation. The 
function of this protein modification in tran- 
scriptional repression is not well understood. 
However, studies published this year*” found 
that Lys 119 ubiquitylation creates a binding 
site for one form of the histone-methyltrans- 
ferase enzyme complex PRC2, and thereby 
promotes the addition of three methyl groups 
to H3 at Lys 27, a modification that is crucial 
for the transcriptional repression of Polycomb 
target genes. 

Six forms of mammalian PRC1 have been 
identified. All contain two proteins that have 
RING fingers — domains that bind zinc and 
are often found in proteins involved in ubiq- 
uitylation. One protein is a subunit of E3 
ubiquitin ligase, either Ring1B or the closely 
related protein Ring1A, and the other is either 
the Polycomb-group RING finger (PCGF) 


DNA 


Figure 1 | Enzyme-nucleosome contacts dictate substrate specificity. 

a, This schematic diagram shows the disc face of the nucleosome and the 
acidic patch formed by histone proteins H2A and H2B. The H24A lysine 

(Lys) amino-acid residues Lys 119, Lys 124, Lys 127 and Lys 129 are indicated 
by yellow circles. b, McGinty et al.' report that the Ring1B-Bmil E3 


protein Bmil or one of five other PCGF-related 
proteins®. In vitro studies’ have established 
that Ring1B alone shows only poor E3 activ- 
ity on nucleosomes, but that a Ring] B-Bmil 
complex functions as an E3 ligase, and acts with 
the ‘E2’ enzyme UbcH5c to monoubiquitylate 
nucleosomal H2A at Lys 119 (E2 enzymes pro- 
vide the ubiquitin molecule for ubiquitylation). 

Building on previous structural studies’ ’°, 
McGinty and colleagues obtained crystals 
of the human Ring1B-Bmil-UbcH5c com- 
plex bound to a nucleosome. They found 
that the complex binds to the flat sides of the 
nucleosome disc (Fig. 1), forming a crescent 
shape. Half of the crescent, the Ring1B-Bmil 
RING-finger dimer, binds the central histone 
surface, and the other half, UbcH5c, is posi- 
tioned directly above the carboxy-terminal 
end of H2A, which contains the target of the 
complex, Lys 119. The researchers’ structure 
does not contain the ubiquitin molecule that 
is transferred to H2A, but the location of the 
active-site cysteine amino acid of UbcH5c sug- 
gests that only Lys 119 or the adjacent residue, 
Lys 118, would be close enough to become 
ubiquitylated. 

Although UbcH5c is in contact with the 
nucleosomal DNA, it barely touches the his- 
tones in the nucleosome. Rather, the precise 
orientation of UbcH5c is controlled by the 
position of the Ring1B-Bmil dimer on the 
nucleosome surface. Here, a recurring theme 
of nucleosome recognition comes into play. 
McGinty and colleagues report that Ring1B 
uses a positively charged arginine amino-acid 
residue as an anchor to contact an ‘acidic patch’ 
formed by H2A and H2B. The acidic patch is 
implicated in nucleosome-nucleosome inter- 
actions"’, and has also been identified as the 
crucial interaction site in all other nucleo- 
some-protein complexes that have been struc- 
turally determined so far. As the authors note, 


UbcH5c 


Lys 127 and Lys 129. 


LANA peptide and RCC1, CENP-C and Sir3 
proteins all use an arginine residue to make 
contact with this acidic patch (see Fig. 4 of the 
Extended Data for the paper’). 

In the case of the Ring1B-Bmil complex, 
McGinty et al. found that other positively 
charged Ring1B residues also interact with 
the acidic patch, extending the Ring1B- 
nucleosome interaction surface. Never- 
theless, a mutational analysis performed by 
the authors shows that the arginine anchor is 
primarily responsible for binding. Mutating 
this residue resulted in a 50-fold decrease in 
binding affinity and H2A monoubiquitylation, 
whereas mutating adjacent positively charged 
residues had less-dramatic effects. The Bmil 
subunit seems to make a lesser contribution 
to nucleosome binding affinity, although 
contact with the amino-terminal helix of H3 
is crucial for binding, and presumably also for 
fixing the orientation of the complex on the 
nucleosome surface. 

What predictions can be made on the basis 
of the authors’ structure? The BRCA1-BARD1 
RING-finger dimer is an E3 ligase that is 
related to Ring1B-Bmil, but BRCA1-BARD1 
ubiquitylates Lys 124, Lys 127 or Lys 129, 
which are located nearer to the C terminus 
of H2A than residues modified by Ring1B- 
Bmil (ref. 11). McGinty and colleagues found 
that, like Ring1B-Bmil, the BRCA1-BARD1 
dimer seems to act with UbcH5c, and that 
the nucleosome-binding region of Ring1B 
and the corresponding region in BRCA1 are 
evolutionarily conserved — a confirmation 
of findings from another study’. Moreover, 
they show that mutation of the acidic patch 
eliminates BRCA1-mediated H2A mono- 
ubiquitylation in nucleosomes. These obser- 
vations make it likely that BRCAI1-BARD1 
binds the acidic patch in a similar fashion to 
Ring1B-Bmil. However, the overall geometry 
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Lys 127 
ys 124 


ubiquitin-ligase enzyme, which is part of the Polycomb repressive complex 
PRC1, positions another enzyme, UbcH5c, just above Lys 119, enabling 
monoubiquitylation of H2A at this residue. c, A hypothetical model of how 
the BRCA1-BARD1 E3 ligase positions the UbcH5c enzyme above Lys 124, 


probably differs, such that only H2A residues 
Lys 124, Lys 127 or Lys 129 can become modi- 
fied (Fig. 1). 

Only some of the six different Ring1B- 
PCGF complexes seem to be able to mono- 
ubiquitylate H2A at Lys 119 in vivo’. This raises 
the possibility that one or more subunits of the 
other PRC1-type complexes occupy different 
interfaces on the E3 ligase or on the nucleo- 
some in a way that precludes either the binding 
of an E2 enzyme or the transfer of ubiquitin to 
H2A. Undoubtedly, McGinty and colleagues’ 
study will serve as the foundation for future 
structural investigations that will resolve how 
other E3 ligases or larger PRC1 assemblies 
interact with nucleosomes. m 
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Radicals promote 
magnetic gel assembly 


Engineering complex tissues requires high-throughput, three-dimensional 
patterning of materials and cells. A method to assemble small gel components 
using magnetic forces from encapsulated free radicals could be just the ticket. 


CHRISTOPHER B. RODELL 
& JASON A. BURDICK 


he ability to control the micrometre- 

| scale assembly of water-swollen polymer 
networks known as hydrogels has great 
potential in many disciplines. It might, for 
example, advance the development of robots 
from soft materials, of lab-on-a-chip diagnostic 
devices and of tissue-engineering constructs. 
Writing in Nature Communications, Tasoglu 
et al.' describe just such an ability. They have 
used paramagnetic forces (the attraction of 
certain materials to an external magnetic 
field) to guide the two- and three-dimensional 
assembly of micrometre-sized hydrogel subu- 
nits called microgels. Like the blocks in the 
game Tetris, these subunits can be turned and 
manipulated to assemble a desired structure. 
Because the technique can simultaneously 
accommodate several types and compositions 
of material in the presence of cells, it is highly 
appealing for tissue-engineering applications. 
Tissue engineering depends on a precise 
interplay between cells and their surroundings, 
and on the hierarchical organization of three- 
dimensional (3D) tissues. Typical behaviours of 
cells, such as proliferation, differentiation and 
migration, are controlled in part through spa- 
tially and temporally distributed signals from 
the local surrounding matrix (the cell niche), as 
well as by soluble signals that diffuse through it. 


Figure 1 | Radical magnetic microgel assembly. Tasoglu et al.' describe a 
method for assembling micrometre-sized hydrogels (water-swollen polymer 
networks) into larger constructs with defined compositions and structures. The 
microgels are formed with or without encapsulated cells, and then a stable radical 
compound (yellow circles) is allowed to diffuse into them. The radicals generate 


Radicals os 


Magnetic 
Radical assembly, 
diffusion stabilization 


Researchers have used a variety of fabrication 
and patterning approaches to introduce these 
signals into synthetic materials in an attempt to 
understand and recapitulate cell-matrix inter- 
actions on multiple size scales. 

Scaffolds for tissue engineering are pro- 
duced by either encapsulating cells within, or 
seeding them into, materials. Historically, such 
scaffolds had uniform composition and did not 
replicate the complexity of, for example, devel- 
oping tissues or the wound-healing response. 
Many researchers have therefore expanded 
their methods to incorporate top-down design 
— the production of a uniform material that is 
subsequently patterned. In one such approach 
(photopatterning), spatially controlled expo- 
sure of materials to light generates changes in 
biomolecule presentation”, mechanical cues’, 
or both’, to control cell behaviour through 
interactions with the matrix. 

Although top-down approaches provide 
some control over uniform cell distributions 
in a single material, many tissues are highly 
non-uniform in matrix composition and cell 
distribution. Bottom-up construction meth- 
ods, in which tissue constructs are assembled 
from smaller components, may thus be better 
suited for replicating biological complexity. 
3D printing, in which the direct deposition 
of material creates precise 3D structures, 
embodies this strategy. Recent advances 
in technology have allowed 3D printing of 
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tissues through the deposition of cellular 
aggregates or cell-laden materials’. However, 
these processes still rely on repeated layer-by- 
layer deposition — a method’ reported in 1986 
—and thus require long fabrication times that 
currently limit construct size. 

Assembly of preformed microgel compo- 
nents may offer an alternative method, because 
this would allow rapid assembly and therefore 
the potential formation of tissue constructs at 
clinically relevant scales. So far, the assembly 
of such components has been achieved largely 
through passive thermodynamic processes’ 
or direct serial manipulation®. To improve 
efficiency, attention has recently turned to 
methods that drive assembly through exter- 
nal forces. In their approach, Tasoglu and 
colleagues used a chemical component 
that can be easily incorporated into nearly 
any hydrogel through diffusion (Fig. 1), 
and which contains a stable free radical (an 
unpaired electron). When exposed to an 
external magnetic field, the encapsulated 
radicals exert a paramagnetic force, which 
enables hydrogel components to be easily and 
rapidly manipulated into a desired location. 

The authors show that the magnetization 
process can be temporally controlled and is 
compatible with cell viability and proliferation. 
Moreover, the radical-containing hydrogels 
are magnetically responsive for up to 24 hours 
after encapsulation of the radicals. However, 
the materials can be quickly rendered magneti- 
cally inactive by treatment with the antioxidant 
vitamin E, which acts as a radical scavenger. 
The researchers observed that vitamin E 
treatment tended to improve the viability and 
proliferation of encapsulated cells when com- 
pared with untreated gels. 

To demonstrate the general applicability 
of their method, Tasoglu et al. have used it 
to form constructs containing many mater- 
ial types, densities and porosities. They also 
show that, following the assembly of hydro- 
gel components into a desired configuration, 
another hydrogel may be added as a precursor 


Vitamin E 


forces in the presence of an external magnetic field, driving microgel motion and 
assembly. The resulting constructs are stabilized by the addition ofa solution of 
a hydrogel precursor that polymerizes on exposure to ultraviolet light. Finally, 
the radicals are quenched (converted to non-radical products) by the addition of 
vitamin E, to prevent them from damaging the encapsulated cells. 


solution that is solidified by ultraviolet-light- 
initiated polymerization — a process that 
stabilizes the assembled construct. Although 
not demonstrated, the method should also be 
able to control the assembly of components that 
provide different stiffnesses, biochemical cues 
or other biologically relevant features. Such 
generality is important for tissue-engineering 
applications. 

Although tissues are inherently 3D, most 
microgel-assembly methods have been 
limited to manipulation in two dimensions. 
By contrast, the authors demonstrate that they 
can levitate many microgel components, and 
simultaneously drive the assembly of truly 3D 
structures of up to a few millimetres across 
by arranging external magnets in appropriate 
orientations. Massively parallel assembly in 
three dimensions will be needed to achieve the 
formation of larger tissue constructs. 

In principle, the new method addresses 
several major challenges in tissue engineering. 
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However, practical applications will be realized 
only when guided assembly of constructs can 
occur on the tissue scale. Incorporating molec- 
ular recognition between hydrogel subunits’, 
to automate the stabilization of 3D assem- 
blies, might help to achieve this. Moreover, it 
will probably be necessary to pattern multiple 
cell types and to introduce a means of perfus- 
ing thicker constructs to provide oxygen and 
nutrients for long-term cell viability. 

Nevertheless, Tasoglu and colleagues’ 
method is likely to stimulate the growing inter- 
est in guided micro-assembly. The use of exter- 
nal driving forces, such as magnetism, allows 
previously inaccessible levels of parallel assem- 
bly and might therefore propel this bottom-up 
approach to clinical use. Before then, however, 
the method will probably have a more direct 
impact on the formation of smaller assemblies, 
such as lab-on-a-chip devices for diagnostic 
applications and organized co-culture systems 
for studies of cell-cell interactions. m 


Cell plasticity helps 
hearts to repair 


Fibroblast cells are known as key players in the repair of damaged heart 
structures. New findings show that injury also induces fibroblasts to become 
endothelial cells, helping to mend damaged blood vessels. SEE ARTICLE P.585 


TORU MIYAKE & RAGHU KALLURI 


eart attacks caused by a blockage in 
H:: coronary artery induce severe 

injury to cardiac muscle cells, lead- 
ing to cell dysfunction and death. The damage 
elicits repair and regenerative responses that 
provoke the removal of dying cells and cell 
debris, recruit immune cells and initiate the 
formation of new blood vessels to recover 
blood supply. Fibroblast cells play a central 
part in this repair response. In this issue, Ubil 
et al.’ (page 585) describe how the plasticity of 
cardiac fibroblasts contributes to this process, 
by showing that fibroblasts can, in response 
to the activity of the transcription factor p53, 
convert into the endothelial cells that line the 
interior surface of blood vessels. 

The heart consists of myocytes (muscle 
cells) and non-myocytes, which include 
cardiac fibroblasts and endothelial cells. The 
fibroblasts produce growth factors and extra- 
cellular matrix (ECM) proteins to maintain 
proper cardiac architecture, contraction and 
function’. They also interact with endothelial 
cells and myocytes to aid angiogenesis (blood- 
vessel formation) and maintain physiologi- 
cal homeostasis’. Following heart damage, 


cardiac fibroblasts are activated to produce 
ECM proteins and soluble factors to compen- 
sate for structural defects, contain the spread of 
damage, reinforce cardiac stiffness and prevent 
cardiac rupture. These activities, collectively 
referred to as fibrosis, aid in remodelling the 
heart musculature. Controlled fibrosis is cru- 
cial for restoring cardiac function after injury. 
However, excessive fibrosis is considered a 
pathological process that can lead to adverse 
effects, including reduced cardiac stiffness 
(diastolic dysfunction) and irregular electri- 
cal connectivity (arrhythmia). 

Although endothelial cells in blood vessels 
are typically thought of as terminally differen- 
tiated cells, they can take on the characteristics 
of mesenchymal cells**, which are gener- 
ally mobile cells surrounded by interstitial 
ECM proteins. During this endothelial- 
to-mesenchymal transition (EndMT), the 
endothelial cells lose the tight junctions that 
hold neighbouring cells together, and gain 
the ability to move, produce ECM proteins 
and contribute to excessive fibrosis, while 
also depleting functional capillaries and the 
endocardium tissue layer. EndMT is induced 
in cardiac endothelial cells by signalling 
pathways that depend on the growth factor 
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TGF-61 and that can be reversed by the activity 
of the protein BMP7 (ref. 4). 

Now, Ubil and colleagues show that, 
following acute cardiac injury, fibroblasts 
(which belong to the mesenchymal cell lineage) 
can undergo a reverse conversion — from 
mesenchymal to endothelial cells (MEndT) 
— and become components of blood vessels 
(Fig. 1). To study this plasticity, they used mice 
in which cells that gain or lose expression of 
cell-type-specific markers can be tracked by 
fluorescence, a technique called genetic fate 
mapping. 

The authors induced ischaemia-reperfusion 
injury by blocking the coronary artery and 
then restoring blood flow in the hearts of 
these mice. Three days later, they found that 
35% of fibroblasts in the injury zone expressed 
the endothelial marker VECAD and were 
located in the interior of the vessel. Of these 
fibroblast-derived endothelial cells, 41% 
took up acetylated low-density lipoprotein, 
which is suggestive of endothelial-cell func- 
tionality. Most of the cells undergoing this 
transition expressed the fibroblast markers 
Colla2 or FSP1, whereas very few expressed 
aSMA, a marker shared by a subset of cardiac 
fibroblasts (myofibroblasts) and the mes- 
enchymal cells generated through EndMT 
(Fig. 1). This finding highlights a functional 
heterogeneity of recruited fibroblasts in injured 
cardiac tissue with respect to their plasti- 
city. Understanding this heterogeneity will 
require future studies using fate mapping of 
myofibroblasts®. 

Ubil and co-workers also found that the 
fibroblast-derived endothelial cells in the mice 
express increased levels of p53, a transcrip- 
tion factor known for its multiple functions, 
including regulation of the cell cycle, apoptotic 
cell death and DNA repair. To investigate the 
involvement of the p53 signalling pathway 
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Figure 1 | Cardiac-cell conversions. The heart's response to injury involves the proliferation and 
activation of cardiac fibroblasts (a type of mesenchymal cell). Although this fibrosis is essential for 
repair, an excessive response can lead to cardiac dysfunction, and evidence is building that this balance 
is regulated by transitions between cell types. Previous studies*’ have shown that endothelial cells lining 
blood vessels can convert to mesenchymal-like cells in a process called the endothelial-to-mesenchymal 
transition (EndMT). These cells, which express aSMA, a marker shared by a subset of cardiac fibroblasts 
(myofibroblasts), contribute to increased fibrosis. Now, Ubil et al.' show that some cardiac fibroblasts 
(distinguished by the markers FSP1 or Colla2) can undergo a reverse, mesenchymal-to-endothelial 
transition (MEndT). This conversion is induced by activity of the transcription factor p53, and leads to 


reduced fibrosis and increased blood-vessel formation. 


in the MEndT program, the authors studied 
cardiac fibroblasts cultured in vitro under 
serum deprivation — a stress condition that 
induces upregulation of p53. The cells formed 
tubular structures reminiscent of endothelial- 
cell architecture and expressed endothelial 
markers, including VECAD and transcrip- 
tion factors such as HoxA9 and HoxD3. 
However, the fibroblasts did not generate 
these tubules without serum deprivation, 
even when p53 was artificially overexpressed, 
suggesting that p53 expression alone is not 
sufficient to induce MEndT and that other 
signalling pathways are involved in launching 
this program. 

The authors went on to show that treating 
mice with the small molecule RITA, which 
enhances p53 signalling, for three days after 
cardiac injury increased the number of fibro- 
blast-derived endothelial cells. The treatment 
also accelerated angiogenesis and decreased 
cardiac fibrosis, leading to an improvement 
in cardiac function. These in vitro and in vivo 
findings suggest that p53 expression in fibro- 
blast-derived endothelial cells has a key role 
in the recovery of cardiac function following 
injury. 

Administration of other p53-activator mol- 
ecules has previously been shown to impair 
angiogenesis, increase apoptotic cell death 
and cause dysfunctional muscle contraction 
in a mouse model of a condition called cardiac 
hypertrophy’. Conversely, Ubil et al. show 
no increase in myocyte p53 expression or the 
number of apoptotic cells after RITA treat- 
ment compared with control mice. These 
contrasting results could be explained by the 


different cardiac-injury models and drugs used, 
which will need to be considered in investi- 
gations of the potential clinical application 


of p53-inducing drugs for targeting cardiac 
fibrosis. 

Nevertheless, Ubil and colleagues’ study 
provides insight into the cardiac-repair pro- 
cess and highlights potential new therapeutic 
strategies. It also adds to the debate of whether 
the term ‘terminally differentiated’ in adult 
tissue might be too confining when cellular 
plasticity is rampant and seemingly func- 
tional in situations of disease or injury, just as 
in embryonic development. = 
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Starve a fever, 
feed the microbiota 


A study finds that the cells lining the gut are modified in response to systemic 
infection, increasing the host’s tolerance to infection in a manner that is 
dependent on the microorganisms that inhabit the gut. SEE LETTER P.638 


SETH RAKOFF-NAHOUM & 
LAURIE E. COMSTOCK 


ammals have a mutualistic relation- 

ship with the consortium of microor- 

ganisms that inhabit their intestines, 
collectively known as the gut microbiota. The 
microbiota profits from an environment that 
is rich in dietary and host-derived nutrients, 
and provides its host with numerous benefits, 
ranging from immune and digestive capabili- 
ties to resistance to pathogen colonization. On 
page 638 of this issue, Pickard et al.’ describe 
another facet of this mutualism. They show that 
ina model of systemic infection, the host modi- 
fies the surface molecules of the epithelial cells 
lining its intestine, which increases host fitness 
in a microbiota-dependent manner. 
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The authors began their study with the 
question of how the beneficial microbiota is 
maintained during the period of diminished 
food consumption — known as anorexia — 
associated with systemic infection in the host. 
Host-derived sugars such as fucose are 
present on the intestinal epithelium and serve 
as an alternative to dietary food sources for the 
gut microbiota. Pickard and colleagues there- 
fore evaluated fucosylation (the addition of 
fucose to molecules that are then secreted to 
the cell surface) of the small intestine following 
systemic administration of bacterial-derived 
molecules, such as lipopolysaccharide (LPS), 
which are recognized by Toll-like receptor 
proteins (TLRs). This administration mimics 
a bacterial infection and induces symptoms of 
sickness, including anorexia. The researchers 


showed that systemic LPS administration leads 
to rapid and sustained fucosylation of glyco- 
proteins throughout the small intestine. These 
fucosylated glycoproteins are released into the 
intestinal lumen where the fucose residues 
are liberated and consumed by the resident 
bacteria in the colon. 

Pickard et al. found that systemic admini- 
stration of LPS induces release of the cell- 
signalling molecule interleukin-23 (IL-23) from 
dendritic cells of the innate immune system, 
which in turn activates innate lymphoid cells to 
release IL-22. This induces expression of a spe- 
cific fucosyltransferase gene, Fut2, in epithelial 
cells of the small intestine, resulting in addition 
of fucose to their surface molecules (Fig. 1). 

Next, the authors demonstrated that mice 
lacking Fut2 recovered weight more slowly 
after LPS-induced anorexia than did control 
mice, indicating that under these conditions, 
fucosylation is beneficial to the host. Further- 
more, this benefit is dependent on the pres- 
ence of the microbiota, because germ-free 
or microbiota-depleted mice also showed 
impaired weight gain in LPS-induced ano- 
rexic conditions, despite the fact that their 
epithelial cells were fucosylated. 

Finally, to investigate whether fucosylation 
improves the host’s fitness during infection, 
Pickard and co-workers infected Fut2-defi- 
cient or control mice with the intestinal bac- 
terial pathogen Citrobacter rodentium. They 
showed that inducible fucosylation in response 


a_ Steady-state 
condition 


to LPS is crucial for limiting both proliferation 
of colonic cells (hyperplasia), which is indica- 
tive of tissue damage, and weight loss during 
infection. This benefit is probably not due to 
increased resistance to infection, because the 
total levels of C. rodentium in the small intes- 
tine were similar in mice with or without Fut2. 
Because the damage to the host is independ- 
ent of pathogen burden, it is probable that this 
microbiota-dependent effect improves host 
fitness through disease tolerance””. 

Recently, Goto et al.’ identified a molecular 
pathway involved in microbiota-triggered fuco- 
sylation on the small intestinal epithelium in 
normal, ‘steady-state’ conditions, which shares 
similarities with the systemic-LPS-inducible 
pathway described by Pickard and colleagues. 
Both studies provide evidence that fucosylation 
is dependent on a subset of IL-22-producing 
innate lymphoid cells. Similar to Pickard et al., 
Goto and colleagues show that fucosylation is 
important for host defence against an intestinal 
bacterial pathogen, Salmonella typhimurium. 
However, in this second study, fucosylation 
inhibited bacterial invasion of intestinal tis- 
sues and, therefore, host protection seems 
to be due to pathogen resistance rather than 
disease tolerance. 

An apparent discrepancy in Pickard and 
colleagues’ study is that they did not detect 
fucose on small intestinal epithelial cells 
in mice that had not been treated systemi- 
cally with LPS, whereas Goto and colleagues’ 
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Figure 1 | Making a meal of disease. a, In steady-state conditions, fucosylation (the addition of fucose 
groups to the glycan chains of molecules secreted to the cell surface) is not induced in the small intestine. 

b, Pickard et al. injected mice with lipopolysaccharide (LPS), which mimics systemic infection, including 
diminished food consumption. They report that this leads to the release of the cell-signalling molecule 
interleukin-23 (IL-23) from dendritic cells of the immune system. IL-23 drives IL-22 production from innate 
lymphoid cells, inducing expression of the gene encoding the fucosyltransferase enzyme Fut2 (not shown) 

in the epithelial cells of the small intestine. Fut2 catalyses fucosylation, and fucose-containing molecules are 
then released into the gut lumen where they transit to the colon. Here, fucose is liberated and consumed by 
members of the microbiota. This process correlates with an increased tolerance to infection in the host. 
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and other studies”® showed steady-state 
fucosylation in the distal small intestine of 
mice. This may be accounted for by differences 
in the intestinal microbiota of the mice used in 
each study. Pickard and co-workers used mice 
from the Jackson Laboratory whose micro- 
biota does not contain segmented filamentous 
bacteria’, which Goto et al. identify as potent 
inducers of fucosylation. It will be interesting to 
determine the factors derived from the micro- 
biota (and, in particular, from the segmented 
filamentous bacteria) that mediate intestinal 
fucosylation, which may act independently of 
Toll-like receptors’. It is likely that the steady- 
state fucosylation of the distal small intestine 
observed by Goto and colleagues would be 
augmented if mice were injected with LPS, 
because the fucosylation reported in the 
current study’ occurred not only in this distal 
region, but throughout the small intestine. 

Perhaps the most exciting aspect of Pickard 
and colleagues’ work is what it adds to our 
understanding of beneficial interactions 
between the host and its microbiota. Although 
there is a growing list of host factors, cells and 
conditions that shape the microbiota, little is 
known about whether, under certain condi- 
tions, the host can select for a beneficial micro- 
biota that increases the host's own fitness. This 
study suggests a mechanism by which such 
selection may occur. An outstanding question 
is how the microbiota contributes to disease 
tolerance. 

Furthermore, the benefit to the host of 
fucosylation of intestinal epithelial cells shown 
by the authors sheds light on potential trade- 
offs with the known negative consequences of 
fucosylation. These include the use of fuco- 
sylated receptors by viruses®, and the use of 
liberated fucose by bacterial pathogens’. The 
complex interactions between the host, its 
microbiota and pathogens, especially regard- 
ing fucose, will continue to be a stimulating 
area for investigation. m 
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Three new Jurassic euharamiyidan species 
reinforce early divergence of mammals 


Shundong Bi'?, Yuanging Wang", Jian Guan®, Xia Sheng* & Jin Meng? 


The phylogeny of Allotheria, including Multituberculata and Haramiyida, remains unsolved and has generated contentious 
views on the origin and earliest evolution of mammals. Here we report three new species of a new clade, Euharamiyida, 
based on six well-preserved fossils from the Jurassic period of China. These fossils reveal many craniodental and post- 
cranial features of euharamiyidans and clarify several ambiguous structures that are currently the topic of debate. Our 
phylogenetic analyses recognize Euharamiyida as the sister group of Multituberculata, and place Allotheria within the 
Mammalia. The phylogeny suggests that allotherian mammals evolved from a Late Triassic (approximately 208 million 
years ago) Haramiyavia-like ancestor and diversified into euharamiyidans and multituberculates with a cosmopolitan 
distribution, implying homologous acquisition of many craniodental and postcranial features in the two groups. Our 
findings also favour a Late Triassic origin of mammals in Laurasia and two independent detachment events of the middle 


ear bones during mammalian evolution. 


Mammalia Linnaeus, 1758 
Allotheria Marsh, 1880 
Euharamiyida (new clade) 

Shenshou lui gen. et sp. nov. Bi, Wang, Guan, Sheng and Meng 
Etymology. Shen, from pinyin of the Chinese word, meaning deity, divin- 
ity or cleaver; shou, from pinyin of the Chinese word for creature, animal 
or beast; specific name after Lu Jianhua, the collector of the holotype. 
Holotype. A nearly complete skeleton from an adult individual (LDN 
HMEF2001, Lande Museum of Natural History, Tangshan, Hebei Prov- 
ince, China) (Figs la and 2d, and Extended Data Figs 1a and 2). Three 
specimens are referred to as the paratypes (see Supplementary Informa- 
tion, section A). 

Paratypes. Three specimens are referred to as the paratypes (see sec- 
tion A of Supplementary Information). 

Locality and horizon. The Tiaojishan Formation, Daxishan site of 
Linglongta, Jianchang County, Liaoning Province, China; the locality 
was dated as approximately 160 million years ago (within the Oxfordian) 
(see Supplementary Information, section C, for age constraint). 
Diagnosis. Medium-sized euharamiyidan with an estimated body mass 
of 300 g (Supplementary Information, section D). Dental formula I’- 
C°-P?-M?/I,-Co-P1-M> (I, incisor; C, canine; P, premolar; M, molar; 
superscript, upper teeth; subscript, lower teeth). The only pair of upper 
incisors (I*) are in contact so that a facet is present on the medial side of 
each tooth; I’ with two cusps. The mesial upper premolar (P*) small (not 
basined) and the ultimate premolar (P*) not significantly larger than 
upper molars. Upper molars with two main cusps in the buccal row, 
separated by a low ridge; three cusps in the lingual one, of which the pen- 
ultimate cusp (B2) is the largest. P, sub-molariform with a main mesio- 
lingual cusp (a1) anda long basined heel with two rows of cusps; M, with 
three cusps in each cusp row, M, with four cusps in the lingual and three 
in the labial row (Fig. 2d; see Supplementary Information, section A, for 
differential comparisons). The terminology we use to designate cusps 
in allotherian teeth is as follows: for upper teeth, the buccal (labial or 
lateral) cusp row is A, lingual row is B, cusps are numbered from the 


distal end; lower teeth, the lingual cusp rowis a, the buccal row is b (lower 
case), numbering starts from the mesial (anterior) end. 
Eleutherodontidae Kermack et al., 1998 

Xianshou gen. nov. Wang, Meng, Bi, Guan and Sheng 
Etymology. Xian, from pinyin of the Chinese word meaning celestial 
being or immortal. 
Locality and horizon. Same as Shenshou. 
Diagnosis. Dental formula: P-C°-P?-M?/T,-Co-P1-M; upper and lower 
molars ovoid in outline, with a shallow central basin. Differ from Sine- 
leutherus in having three well-separated cusps of I’, non-molariform P, 
with a hypertrophic mesiolingual cusp (a1) and a weakly basined heel. 
Differ from Eleutherodon in the ovoid upper molars by absence of third 
row of cusps, and lack of cuspules and transverse ridges in the central 
basin. Differ from Shenshou and Arboroharamiya in having an extra- 
small I’, ovoid upper and lower molars, more distally positioned disto- 
buccal cusp (A1) on P*and M’;a hypertrophic mesiolingual cusp (a1) 
on lower molars (Fig. 2e, f; see Supplementary Information, section A, 
for differential comparisons, and Extended Data Figs 5 and 6 for addi- 
tional figures). 

Xianshou linglong sp. nov. Wang, Meng, Bi, Guan and Sheng 

Etymology. linglong, from pinyin of the Chinese word, meaning ‘exquisite’, 
and also after the town name Linglongta where the specimen came from. 
Holotype. A skeleton preserved on a split slab of laminated siltstone 
(IVPP V16707A-B, Institute of Vertebrate Paleontology and Paleoan- 
thropology, Chinese Academy of Sciences, Beijing, China; Figs 1b and 
2e, and Extended Data Figs 1b and 5). 
Diagnosis. Estimated body mass of 83 g. I’ with three cusps; I’ extre- 
mely small and budlike. Upper molars with sharp cusps and stronger 
and sharper ridges (flutings) than those in Shenshou and X. songae; M' 
with two main cusps at the mesial and distal end and connected by alow 
and sharp ridge in each cusp row; an enlarged and more distally extended 
distobuccal cusp (A1) on P* and M'; Py enlarged with a hypertrophic 
mesiolingual cusp (a1) and a small talonid heel; cusp al of lower molar 
procumbent, projecting mesially beyond the crown. 


lKey Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, 
China. @Department of Biology, Indiana University of Pennsylvania, Indiana, Pennsylvania 15705, USA. Beijing Natural History Museum, 126 Tianqiao Street, Dongcheng District, Beijing 100050, China. 
*Paleontological Museum of Liaoning, Shenyang Normal University, Shenyang, Liaoning 110034, China. Division of Paleontology, American Museum of Natural History, Central Park West at 79th Street, 


New York, New York 10024, USA. 
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Figure 1 | The holotypes of three euharamiyidan species. a, Holotype 
(LDNHME2001) of Shenshou lui. b, Holotype (IVPP V16707A) of Xianshou 
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Xianshou songae sp. nov. Meng, Guan, Wang, Bi and Sheng 
Etymology. The specific name is after Rufeng Song, the collector of the 
holotype specimen. 

Holotype. A skeleton preserved partial skull, mandible, and most of 
the postcranial skeleton (BMNHC-PM003253, Beijing Natural History 
Museum, China) (Figs 1c and 2f, and Extended Data Fig. 6). 
Diagnosis. A small euharamiyidan with an estimated body mass of 
40 g. Differ from X. linglong by having significantly smaller body size 
and in having P* more transversely oval, cusp Al on P* and M’ pro- 
portionally smaller and less distally extended. The lingual row of M' 
bears three cusps, of which the middle (B2) is the largest; lower molars 
anteroposteriorly short with a vertical hypertrophic mesiolingual cusp 
(al) and low buccal cusps. 
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linglong. c, Holotype specimen (BMNHC-PM003253) of Xianshou songae. See 
Extended Data Fig. 1 for interpretations of the skeletal features. 


Craniodental features 


The general skull shape of the new euharamiyidans is therian-like with 
a broad basicranial region and a tapered rostrum in dorsal and ventral 
views (Extended Data Figs 3 and 4) but multituberculate-like in lateral 
view (Fig. 2a, b). A small septomaxilla is probably present (Extended Data 
Fig. 2a). The zygomatic arch is slender but fully developed, with the ante- 
rior root lateral to P*. As reflected by the skull shape, the cranial cavity 
is more inflated than those of eutriconodontans. The glenoid fossa is 
anterolateral to the bulging promontorium of the petrosal and antero- 
posteriorly oriented, and lacks the postglenoid process (Extended Data 
Figs 4b and 5b). The dentary is similar to that of Arboroharamiya’ and 
multituberculates in having a distinct diastema between the incisor and 
premolar, a small coronoid process, a masseteric fossa extending ante- 
riorly to the level of P4 and a low-positioned mandibular condyle that is 


Figure 2 | Teeth, skull and mandibles of euharamiyidans. a, Reconstruction 
of the skull and jaws of Xianshou linglong (the small I’ is on the medial side 
of the large I’). b, The labial view of the lower jaw of Xianshou linglong. 

c, Lingual view of the lower jaw of Xianshou linglong. Part of the ascending 
ramus and coronoid bone have not been preserved and are reconstructed based 
on Shenshou lui. d, Right LP, M'”, and left Py-M, of Shenshou lui in occlusal 
view. e, Left I’, I?, P4,M! and P,-M, of Xianshou linglong in occlusal view. 
Dashed lines represent the alveolus of P*f, Right P°-M7’, and P, and M, of 
Xianshou songae in occlusal view. See text for description of the skull and 
mandible. Dental formula are I'’-C°-P?-M?/I,-Cg-P3-Mb (the upper incisor is 
not preserved in this species). Shenshou and Xianshou have a multicusped I? 
and a greatly enlarged 1, that is fully covered with enamel. There is no upper 
or lower canine. There are one lower and two upper premolars, and two upper 
and lower molars. Small enamel ridges (flutings) vary in size and density on 
cheek tooth cusps and basins. Upper premolars are broadly basined with main 
cusps located peripherally. Lower premolar has an enlarged al, which is 
hypertrophic in Xianshou and Arboroharamiya', but there is no serration. Cusp 
Al (distobuccal) of the upper premolar and molars is the largest and extends 
distally. Cusp al (mesiolingual) of the lower molars is the largest cusp, 
extending mesially. The lingual cusps of lower molars bear wear on their lingual 
sides, indicating that they bit into the central valley of the upper molars in 
mastication. Molars have a fusiform (spindle-shaped) basin that is closed 
mesially by a ridge in the upper molars and distally in the lower molars. The 
ridge can be erased owing to wear so that the central valley becomes confluent 
in Shenshou. Scale bars: a-c, 10 mm; d-f, 2 mm. Cusp terminology follows 
ref. 12. See photographs of teeth in Extended Data Figs 2, 4b, c, 5, 6 and 7a-c. 
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orientated ventrodorsally. A vestigial coronoid is present on the medial 
side of the jaw (Extended Data Fig. 2b), posteroventral to M2, similar to 
that of Arboroharamiya and the Jurassic multituberculate Kuehneodon’. 
A reduced coronoid, usually indicated by a scar on the dentary, is also 
present in some more advanced mammals, such as Amphi therium’ and 
Zhangheotherium*. This is in contrast to sizable coronoids that cover the 
anterior portion of the postdentary unit in primitive mammaliaforms, 
such as Morganucodon* and Haldanodon’. Unlike multituberculates 
but similar to Arboroharamiya, the dentary has a small angular process 
that inflects medially. 

Dentitions are preserved in situ in the six specimens (Figs 1 and 2, 
and Extended Data Figs 1-7), confirming the tooth identification and 
orientation of Arboroharamiya’ as well as reinterpretation’ of some pre- 
viously known euharamiyidans’""°. The cusp arrangement and mor- 
phology from Haramiyavia, Thomasia, Arboroharamiya and the new 
species reported here show consistently larger row A cusps (buccal) than 
row B cusps in upper molars, and larger row a cusps (lingual) than row 
b cusps in lower molars. In the lower cheek teeth, the mesiolingual cusp 
(a1) is the largest, whereas the distobuccal one (A1) is the largest in the 
upper teeth. The lingual lower cusp row bites into the central valley of the 
upper molars as indicated by wear facets in the upper and lower teeth 
(Extended Data Fig. 7a—c). These dental features and occlusal patterns 
should also be applicable to those known only from isolated teeth, such 
as Eleutherodon and Sineleutherus. It is also noteworthy that similar 
tooth cusp morphology and occlusal patterns are present in the upper 
and lower second molars of some Jurassic multituberculates"’, in which 
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the mesiolingual cusps are the tallest among cusps in the lower molar. 
The molars with two rows of multiple cusps and their wear patterns, 
coupled with the morphologies of the lower jaw and the glenoid fossa, 
convincingly indicate palinal jaw movement during mastication”. 


Postcranial skeleton 


Although the vertebral column varies considerably in extant mam- 
mals, particularly those from the southern continents’, the general or 
primitive cervical and thoracolumbar counts for therian mammals are 
conservative’. The axial skeleton of the new euharamiyidans is therian- 
like in possessing 7 cervical and 19 thoracolumbar vertebrae (13 thoracic 
and 6 lumbar; Fig. 3). In contrast to the docodontan Castorocauda", the 
eutriconodont Liaoconodon and Yanoconodon'*”’, and the symmetro- 
dont Akidolestes'*, which all have lumbar ribs, euharamiyidans (includ- 
ing Arboroharamiya) have true lumbar vertebrae that lack ribs but have 
obliquely oriented facets of the prezygapophysis, elaboration of the trans- 
verse process, and the laminapophysis (a mammalian neomorph that 
split into the anterior metapophysis and posterior anapophysis)’*(Fig. 3b). 
These features define a distinct thoracolumbar boundary (Fig. 3a) and, 
along with fixation of cervical vertebrae” and well-developed ribcage, 
are indicative of development of the diaphragm, a unique feature that 
allows mammals to progressively increase ventilation in adapting to 
fast movement’’”®. 


Figure 3 | Skeletal features of euharamiyidans. a, Reconstruction of the 
skeleton of Shenshou lui, based in part on Xianshou linglong and Xianshou 
songae. b, The vertebral column of Shenshou lui. c, The anterior section of 
the rib cage in Shenshou lui. d, The right scapular of Shenshou lui. e, The 
distal end of the humerus in Shenshou lui. f, Reconstruction of the hand of 
Shenshou lui, based on LDNHMF2001 and WGMV-001. g, Reconstruction of 
the foot of euharamiyidans, based on Shenshou lui (JZT-D061) and Xianshou 
songae. The cervical ribs, present in monotremes and many Mesozoic 
mammals*!””°*, are absent. Similar to therians, such as Eomaia®’ and 
Sinodelphys”’, the first ten thoracic ribs are connected to the ossified 
manubrium and sterna via costal cartilages and the last three ribs float freely. 
The capitulum of each rib articulates between two thoracic centra at a 
ventral position. The last thoracic rib (13th) is identified as the diaphragmatic 
vertebra because the articular facets of its zygapophyses with the first 

lumbar are more vertical than horizontal, as in many extant mammals”. 

The anticlinal vertebrate is located at lumbar four. The tail consists of 18-22 
caudal vertebrae, with the proximal ones bearing expanded transverse 
processes as in extant prehensile therians”’. The scapular blade is roughly 
rectangular and has a large and deep infraspinous fossa but lacks a 
supraspinous fossa. The clavicle is strap-shaped and slightly curved anteriorly. 
Its rough proximal end indicates a flexible contact with the depression on 
the clover-leaf-shaped interclavicle. The distal end of the humerus has a 
bulbous radial condyle, a relatively smaller spherical ulna condyle, and a 
prominent entepicondyle. The femur has a hemispherical head with an 
extensive articular surface but a short neck, the greater trochanter is triangular 
and does not extend above the head. The distal condyles are small with a 
shallow patellar groove. The ankles are comparable to those of 
multituberculates, such as Kryptobaatar”, in that the calcaneus has a short 
and mediolaterally compressed tuber calcanei and are partly superposed by 
the astragalus. The cuboid is positioned obliquely and has little contact with 
the calcaneus so that metatarsal V is separated from the calcaneus. The 
entocuneiform is elongated, and its joint with metatarsal I is offset anteriorly 
from the joint of the intermediate cuneiform and metatarsal II, similar to that 
of Asioryctes and Eomaia**. Metapodials and proximal phalanges possess a 
well-developed palmar groove for the digital flexor muscle tendon. Metatarsals 
I and V are short and robust compared to others. The distal ends of proximal 
and intermediate phalanges are well trochleated. Terminal phalanges are 
compressed laterally and curved strongly with a sharp tip; each has a massive 
digital flexor tubercle on the ventral side and a small dorsally extended ridge. 
as, astragalus; ce, centrale; cm, calcaneum; ct, capitate; cu, cuboid; ec, 
ectocuneiform; en, entocuneiform; hm, hamate; L, lumbar; Itp, lumbar 
transverse process; lu, lunate; me, mesocuneiform; T, thoracic; td, trapezoid; 
tm, trapezium; sp, scaphoid; pi, pisiform; tq, triquetrum; ts, tarsal spur. The 
10mm scale applies to panel a and the 5 mm scale bar applies to all other 
postcranial elements. See photographs of skeletons and postcranial structures 
in Extended Data Figs 1, 3, 4a, 5a and 7d-g. 
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The scapula (Fig. 3d and Extended Data Fig. 7g) is primitively similar 
to that of Megazostrodon, Haldanodon and monotremes*’” in that it 
lacks the supraspinous fossa, but differs from those taxa in that it has a 
reduced coracoid that is fused to the scapula and forms a small part of 
the glenoid fossa that faces ventrally. The clavicle (Fig. 3c) is similar to 
that of multituberculates”’. As in multituberculates the humerus has a 
slight shaft torsion (approximately 15°)”, in contrast to a strong prox- 
imodistal torsion in cynodonts and other early mammals*™. The distal 
end of the humerus is highly comparable to that of multituberculates*** 
(Fig. 3e and Extended Data Fig. 7e). In the wrist, the hamate is hyper- 
trophied and the scaphoid and triquetrum are enlarged, as in some extant 
arboreal marsupials””® and the scansorial Sinodelphys’’. Metacarpal V 
is offset from the hamate and the pisiform is sizable, similar to those of 
Zhangheotherium* (Fig. 3g and Extended Data Fig. 8a). 

The epipubic bone, a plesiomorphic feature common in several groups 
of Mesozoic mammals, including eutherians”*, is absent. The pelvis is 
similar to that of therians in being shallow and differs from the deep 
pelvis of multituberculates and monotremes in having a reduced pubis 
and an ischium with a straight dorsal margin and an ischial tuberosity 
that is slender and less extended dorsoposteriorly™*. The ilium is shorter 
than that of multituberculates. The femur has a short neck, similar to 
eutriconodonts'*”*”’, symmetrodonts*!**' and more primitive forms*"””, 
but different from multituberculates, in which the femoral head has 
a long neck and the greater trochanter projects beyond the head”*. 
There is no parafibula, differing from Eomaia and Jeholodens****. The 
proximal end of the tibia is roughly symmetrical, contrasting to that of 
multituberculates**, The bony extratarsal spur is distinctive and displays 
different relative sizes in different species reported here. The spur or its 
os calcaris exists in monotremes* and is common in Mesozoic mam- 
mals and their close relatives'***. As in Arboroharamiya and some extant 
arboreal didelphids and cheirogaleid primates’, the manus and pes are 
characterized by relatively short metapodials and long phalanges (Fig. 3f, g 
and Extended Data Fig. 8). The limb features are collectively indicative 
of scansorial and/or arboreal adaptation’”*”. 


Phylogenetic relationship 


A major unsolved problem in mammalian taxonomy and phylogeny 
concerns Allotheria (Multituberculata and Haramiyida)*’***°*?, which 
affects how we view the early evolution of mammals. Most recent studies 
present contrasting hypotheses that either place allotherians in mammals’, 
indicating an explosive model for the origin of mammals in the Late 
Triassic (approximately 208 million years ago) and a long-fuse model 
for the origin of therians, or separate them from multituberculates and 
place them outside mammals”, suggesting an explosive model for the 
origin of the Mammalia in the Middle Jurassic epoch. In light of the 
new data reported here, we are able to revise characters used in previous 
studies’, discuss existing problems relating to Megaconus (Supplemen- 
tary Information, section E), the postdentary trough in ‘haramiyidans’ 
(now demonstrated to be a paraphyletic group) (Supplementary Infor- 
mation, section F) and Hadrocodium (Supplementary Information, sec- 
tion G) and conduct phylogenetic analyses including the new species 
(Supplementary Information, sections H-J). The result shown in Fig. 4 
(see also Extended Data Fig. 9) is consistent with most recent phyloge- 
netic analyses'"'"*"’, which suggest that the ‘haramiyidans’ are related 
to multituberculates to form Allotheria within the Mammalia. This sup- 
ports a Late Triassic origin of mammals’. This hypothesis gains support 
from some Late Triassic ‘symmetrodontans’, such as Kuehneotherium 
and Woutersia, that are considered taxonomically to be members of the 
trechnotherian mammals******, Our analyses also recognize a new clade, 
here named Euharamiyida, that pairs with Multituberculata; this is consis- 
tent with the view that previously discovered ‘haramiyidans’ seem to form 
a paraphyletic group, from which multituberculates were derived’***”. 
Primitive species traditionally placed in ‘haramiyidans’, such as Hara- 
miyavia and Thomasia, form the stem members of allotherians. This 
resultant topology remains the same whether Megaconus was included 
(Extended Data Fig. 10a) or both Megaconus and Hadrocodium were 
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Figure 4 | Phylogeny of mammals with focus on Allotheria. This simplified 
cladogram is based on the consensus tree (Extended Data Fig. 9) computed 
from a data matrix with 113 taxa and 495 characters (see Supplementary 
Information). The dashed line of ‘symmetrodontans’ indicates occurrences 
of taxa that existed in the Late Triassic, such as Kuehneotherium and 
Woutersia*, but are not included in our phylogenetic analyses because of 
their fragmentary preservation. Tree nodes represent the following clades: 
(1) Mammaliaformes; (2) Mammalia; (3) unnamed clade consisting of 
Eutriconodonta, Allotheria and Trechnotheria**; (4) Allotheria; (5) 
Trechnotheria; and (6) Cladotheria. Mammaliaformes, Crown Mammalia, 
Allotheria, Multituberculata and Euharamiyida are also marked as nested 
colour blocks from the most to the least inclusive group. 


excluded (Extended Data Fig. 10b) (see discussions in Supplementary 
Information, sections E and G). 


Character evolution of early mammals 


With the discoveries of the new euharamiyidans, it becomes increas- 
ingly evident that the cranial and postcranial features of euharamiyidans 
and multituberculates are similar to each other and to other mammals. 
However, the fundamental obstacle in interpreting their mammalian 
affinity remains the fact that the tooth pattern consists of two main rows 
of multiple cusps that are capable of longitudinal (palinal) chewing func- 
tion in allotherians'***”’. If allotherians were placed outside mammals, it 
is equally difficult to derive the allotherian tooth pattern from other mam- 
maliaformes, such as tritylodontids. Our phylogenetic analyses (Fig. 4) 
suggest that the primitive allotherian tooth pattern, as represented by 
Haramiyavia, was probably derived by developing an extra cusp row, 
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or rows, from a triconodont-like tooth pattern or even from a tooth pattern 
with an initially reversed triangular cusp arrangement. It was noted that 
in lateral view the teeth of Haramiyavia are more similar to those of 
Sinoconodon and Morganucodon than to those of multituberculates*. 
We note, however, that the tooth pattern of Haramiyavia is also similar 
to that of Woutersia, which co-existed with Theroteinus, another Late 
Triassic haramiyidan**”’. The tooth morphology and occlusion of euha- 
ramiyidans indicate that, if the allotherian tooth pattern was derived 
from a triconodont tooth pattern, the secondary cusp row has to be added 
on the buccal side in the lower teeth. In the conventional view, how- 
ever, development of extra cusps on the lingual cingula is common, but 
buccal cingula are rare on lower molars*°, although exceptions exist, such 
as Hallautherium*’. Nonetheless, the orientation of an isolated tooth 
in early mammals is not always certain, as demonstrated in the case of 
eleutherodontids’ (this study). There is no convincing evidence to rule 
out the possibility that additional cusps could be added on the buccal 
side of the tooth in early mammals. Better material with teeth in situ 
from each taxon of interest, such as Woutersia, is needed to test this 
hypothesis. 

Interpretations of character evolution in early mammals depends 
on their phylogeny. If *haramiyidans’ were separated from multituber- 
culates and placed outside mammals, while multituberculates fell within 
mammals”, then numerous similar craniodental and postcranial fea- 
tures, particularly the molar pattern with two cusp rows and bilateral 
occlusion, must have evolved independently in ‘haramiyidans’ and mul- 
tituberculates during different periods of time. In addition, detachment 
of the postdentary bones from the dentary would have evolved at least 
four times independently in ‘haramiyidans’, multitubuculates, mono- 
tremes and therians. However, our phylogeny (Fig. 4 and Extended Data 
Figs 9 and 10) indicates that Euharamiyida and Multituberculata were 
probably derived from a Haramiyavia-like common ancestor at a min- 
imum oldest age (according to current fossil records; future finds may 
reveal an earlier ancestor) in the Late Triassic and diversified thereafter 
during the Jurassic epoch, with known euharamiyidans adapting to a 
scansorial and/or arboreal lifestyle which may explain their rare fossil 
record. In contrast to interpreting numerous parallelisms in ‘harami- 
yidans’ and multituberculates”, our hypothesis favours homologous 
acquisition of many similar craniodental and postcranial features in 
euharamiyidans and multituberculates, such as reduction of teeth, enlarge- 
ment of the lower incisors, possessing only two molars in each side of the 
upper and lower jaws, and a palinal chewing motion. Moreover, euhar- 
amiyidans are similar to multituberculates in lacking the postdentary 
trough and Meckelian groove, indicating the presence of the definitive 
mammalian middle ear’. If the reinterpretation is correct—that the den- 
tary of Haramiyavia has only the Meckelian groove (see Supplemen- 
tary Information, section F)—then the clade containing Eutricondonta, 
Allotheria and Trechnotheria’’ (Fig. 4 and Extended Data Fig. 9) would 
have evolved from a common ancestor that had a transitional mamma- 
lian middle ear’®. This clade and the geological and geographic occur- 
rences of its earliest known members are consistent with accumulating 
evidence from Gondwana landmasses that shows a cosmopolitan distri- 
bution of members in the clade* and suggest a Laurasian origin of mam- 
mals. Finally, by reinterpretating Hadrocodium as having postdentary 
bones (see Supplementary Information, section G), our phylogeny sug- 
gests that detachment of the postdentary bones evolved twice indepen- 
dently during the early evolution of mammals, once in the clade leading 
to monotremes and once towards the clade containing Eutricondonta, 
Allotheria and Trechnotheria. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | The holotypes of three euharamiyidan species. 
a, Holotype specimen (LDNHMEF2001) and line drawing of Shenshou lui. 
b, Holotype specimen (IVPP V16707A) and line drawing of Xianshou linglong. 
c, Holotype specimen (BMNHC-PM003253) and line drawing of Xianshou 
songae. as, astragalus; c, cervical vertebrate; ca, caudal vertebrate; cc, costal 
cartilage; cm, calcaneum; ct, capitate; cu, cuboid; en, entocuneiform; fr, frontal; 
hm, hamate; ic, interclavicle; ip, intermediate phalanges; 1, lumbar vertebrae; Ic, 
left clavicle; Ife, left femur; Ifi, left fibula; lh, left humerus; li, left ilium; lis, left 
ischium; lm, left mandible; Ira, left radius; Isc, left scapular; Ipb, left pubis; Iti, left 


ARTICLE 


tibia; lu, lunate; lul, left ulna; m, maxilla; mb, manubrium; mc, metacarpal; mt, 
metatarsal; n, nasal; pa, parietal; pi, pisiform; pm, premaxilla; pp, proximal 
phalanges; r, rib; rc, right clavicle; rfe, right femur; rfi, right fibula; rh, right 
humerus; ri, right ilium; ris, right ischium; rm, right mandible; rra, right radius; 
rsc, right scapular; rti, right tibia; ru, right ulna; s, sacral vertebrae; sm, 
septomaxilla; sp, scaphoid; sq, squamosal; stb, sternebra; t, thoracic vertebrae; 
td, trapezoid; tm, trapezium; tp, terminal phalanges; tq, triquetrum; tr, thoracic 
ribs; ts, tarsal spur. 
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Extended Data Figure 2 | Close-up views of craniodental structure of 
Shenshou lui (LDNHMEF2001). a, The skull and mandibles of Shenshou lui 
(LDNHME2001). ap, angular process; m, maxilla; maf, masseteric fossa; n, 
nasal; pal, palatine; pm, premaxilla; ptf, pterygoid fossa; sm, septomaxilla; 

sq, squamosal. b, The lingual surface of the mandible of Shenshou lui 
(LDNHME2001). The close-up view shows the vestige of the coronoid bone at 
the ventrodistal side of the ultimate lower molar and the wear facets on the 
lingual sides of the lingual cusps of M, 2. c, The rostrum of the skull of Shenshou 
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lui (LDNHMEF2001). The figure is a close-up view of the rostrum region in a. It 
shows the relationship of the alveoli for P*“ with the molars and the wear facets 
on the medial side of left I?. d, Photographs and stippling drawings of the 
dentition of Shenshou ui (LDNHME2001). The dentitions were coated with 
ammonium chloride to enhance contrast. The holotype of Shenshou is an old 
individual, judging from the deeper wear on its teeth. The mesial end of the 
upper molar basin and the distal end of the lower molar basin appear to be 
open, not closed by ridge or cusps, but this may be partly due to the tooth wear. 
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Extended Data Figure 3 | Paratypes 1 and 2 of Shenshou lui. a, Dorsal view of the skeleton of Shenshou lui (paratype 1, WGMV-001). b, The split skeleton of 
Shenshou lui (paratype 2, JZT-CKO05A, B). 
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Extended Data Figure 4 | Paratype 3 and skulls of paratypes 1 and 2 of Note the bulg romontorium of the petrosal, the inflected angular process 
Shenshou lui. a, Dorsal view of the skeleton of Shenshou lui (paratype 3,JZT- and small right P”. c, Lateral view of the skull and mandible of Shenshou lui 
D061). b, Ventral view of the skull of Shenshou lui (paratype 1, WGMV-001). (paratype 2, JZT-CK005). 
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Extended Data Figure 5 | Holotype and close-up views of craniodental 
features of Xianshou linglong (IVPP V16707B). a, Counterpart of the 
holotype specimen of Xianshou linglong (IVPP V16707B). ca, caudal 
vertebrate; ip, intermediate phalanges; Ifi, left fibular; lh, left humerus; Ira, left 
radius; Iti, left tibia; lu, left ulna; mt, metatarsal; pp, proximal phalanges; r, rib; 
rfe, right femur; rfi, right fibular; tp, terminal phalanges. b, c, Skull (two sides) 
and mandibles of Xianshou linglong (IVPP V16707A). b, the skull (ventral 
view) and left mandible (medial view) and right mandible (labial view). ¢, the 
skull (dorsal view) and left mandible (later view); ap, angular process; fr, frontal; 
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glf, glenoid fossa; hy, hyoid; ima, internal acoustic meatus; m, maxilla; mac, 
mandibular condyle; maf, mandibular foramen; mtf, masseteric fossa; n, nasal; 
pa, parietal; pm, premaxilla; sm, symphysis of mandible (left); vpt, ventral ridge 
of pterygoid fossa; za, zygomatic arch. d, Photographs and stippling drawings of 
the dentition of Xianshou linglong (IVPP V16707). P” is broken; its crown was 
left in the matrix so that its outline can be reconstructed in the drawing. The 
lingual side of P, and labial side of M,which are exposed or broken are line- 
hatched. The dentitions were coated with ammonium chloride to enhance 
contrast. e, Incisors of Xianshou linglong (IVPP V16707A). 
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Extended Data Figure 6 | Close-up views of craniodental features of 
Xianshou songae (BMNHC-PM003253). a, Photographs and stippling 
drawing of the dentition of Xianshou songae (BMNHC-PM003253). The 
distolingual corner of Mis not exposed, indicated by hatched lines. Py and M, 
are also shown flipped horizontally and the lingual sides of P4 and M, are not 
exposed, indicated by hatched lines. The dentition was coated with ammonium 


chloride to enhance contrast. b, The skull and mandibles of Xianshou songae 
(BMNHC-PM003253). Note that the right upper and lower dentition are 
preserved in occlusion, showing that lingual cusp A, of M; bites into the valley 
of upper Mp). amf, anterior border of the masseteric fossa; mac, mandibular 
condyle; maf, mandibular foramen; ptf, pterygoid fossa. c, Teeth in situ of 
Xianshou songae (BMNHC-PM003253). 
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Extended Data Figure 7 | Wear facets and skeletal features of 
euharamiyidans. a, Left lower dentition of Shenshou lui (LDNHMF2001) in 
lingual view. The image shows the wear facets on lingual sides of the lingual 
cusps. b, Left Py-M, of Shenshou lui (LDNHMF2001), coated with ammonium 
chloride to enhance contrast, in roughly occlusal view. c, Wear facets in the 
basins of the P** and on the buccal sides of P?-M!. These wear facets indicate 
that A, of Py must have occluded into the P** basins or moved from P* to P* 
basins. The wear facets also indicate that the lingual cusp rows of the lower 
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molars bite into the central valley of the upper molars so that the lingual and 
buccal sides of the lingual lower cusps and the lingual side of the lower buccal 
cusps bear wear facets. Similarly, the buccal sides of the buccal cusps of upper 
molars have wear facets. d, Vertebral column in Shenshou lui. L, lumbard; Itp, 
lumbar transverse process; T, thoracic. e, The distal end of the humerus in 
Shenshou lui. f, The anterior section of the rib cage in Shenshou lui. g, the right 
scapula of Shenshou lui. 
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Extended Data Figure 8 | Euharamiyidan manus and pes, and their ternary 
plots. a, The manus of Shenshou lui (LDNHMF2001). b, The pes of Shenshou 
lui (JZT-D061). as, astragalus; cm, calcaneum; ct, capitate; cu, cuboid; en, 
entocuneiform; ip, intermediate phalanges; mc, metacarpal; mt, metatarsal; pi, 
pisiform; pp, proximal phalanges; sp, scaphoid; td, trapezoid; tm, trapezium; 
tp, terminal phalanges; tq, triquetrum; ts, tarsal spur. c, Ternary diagrams 
showing intrinsic manual and pedal ray III proportions. Ternary plots showing 
relative metapodial, proximal and intermediate phalangeal lengths for the third 
digit ray of the hand and foot. The lengths of the third metapodial, proximal 
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phalanx and intermediate phalanx are shown on their respective axes as a 
percentage of the combined length of the three segments. Compared to both 
fossil and extant taxa, euharamiyidans have the intrinsic manual and pedal ray 
proportions typical of arboreal species in which the proximal and intermediate 
phalanges are long relative to the metapodials. Ar, Arboroharamiya jenkinsi; 
Eo, Eomaia scansoria; Je, Jeholodens jenkinsi; Ma, Maotherium sinensis; Sb, 
Sinobaatar lingyuanensis; Sd, Sinodelphys szalayi; Sh, Shenshou; X1, Xianshou 
linglong; Xs, Xianshou songae. 
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Chaetophractus 
Didelphodon 
Pediomys 
Turgidodon 
Mayulestes 
Pucadelphys 
Andinodelphys 
Didelphis 
Marmosa 
Caenolestes 
Dasyurus 
Petauroides 
Acrobates 
Kokopellia 
Gobiconodon 
Repenomamus 
Amphilestes 
Yanoconodon 
Jeholodens 
Trioracodon 
Hadrocodium 


Extended Data Figure 9 | The strict consensus tree of 198 equally most 
parsimonious trees. Tree length = 2,509; consistency index (CI) = 0.3304; 
homoplasy index (HI) = 0.6696; retention index (RI) = 0.7901. The simplified 
version of this consensus tree is presented in Fig. 4. Note that Tinodon was 
placed as the sister taxon of Allotheria in this cladogram, which is inconsistent 
with the taxonomic position of the genus**. This tree topology and that in 
Extended Data Fig. 10b show the phylogenetic instability of Tinodon, as in 
other studies'''®!””°, that may be attributed to a significant amount of missing 


data. However, the current phylogenetic position of Tinodon may reflect the 
possibility that the allotherian tooth pattern, as represented by Haramiyavia, 
was derived by developing an extra cusp row from a triconodont-like tooth 
pattern or even from a tooth pattern with an initially reversed triangular cusp 
arrangement, as discussed in the main text. Better material is needed to 
secure the phylogenetic position of Tinodon and to test our hypothesis. See 
Supplementary Information, section H, for methods as well as tree 1 and 
related data. 
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2,448; CI = 0.3386; 


E, for discussion. b, The strict consensus tree of 101 equally most parsimonious 


trees without Megaconus and Hadrocodium. Tree length 


Extended Data Figure 10 | Strict consensus trees with Megaconus and 
without both Megaconus and Hadrocodium. a, The strict consensus tree 


HI = 0.6614; RI = 0.7952. See Supplementary Information, section G, for 


of 133 equally most parsimonious trees (including Megaconus). Tree 


length 


discussion. Note that Tinodon is clustered with other ‘symmetrodonts’ plus the 
clade leading to therians, and that eutriconodonts are split into two groups. See 


Supplementary Information, sections E and G, for discussion. 


2,528; CI = 0.3283; HI = 0.6717; RI = 0.7910. Note that Megaconus is 


separated from allotherians and placed outside the crown mammals. This 


suggests that the placement of Megaconus in allotherians*’ needs to be tested 


when new material becomes available. See Supplementary Information, section 
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Mesenchymal-endothelial transition 
contributes to cardiac neovascularization 


. . : 2 . Ps . . . 
Eric Ubil', Jinzhu Duan??*>®7*, Indulekha C. L. Pillai?’+>®7, Manuel Rosa-Garrido??*?, Yong Wu, Francesca Bargiacchi’®, 
Yan Lu*3+5-67, Seta Stanbouly?+>>7, Jie Huang?+>°7, Mauricio Rojas", Thomas M. Vondriska**:”*»?, Enrico Stefani?®? 


& Arjun Deb?4>7 


Endothelial cells contribute to a subset of cardiac fibroblasts by undergoing endothelial-to-mesenchymal transition, but 
whether cardiac fibroblasts can adopt an endothelial cell fate and directly contribute to neovascularization after cardiac 
injury is not known. Here, using genetic fate map techniques, we demonstrate that cardiac fibroblasts rapidly adopt 
an endothelial-cell-like phenotype after acute ischaemic cardiac injury. Fibroblast-derived endothelial cells exhibit 
anatomical and functional characteristics of native endothelial cells. We show that the transcription factor p53 regulates 
such a switch in cardiac fibroblast fate. Loss of p53 in cardiac fibroblasts severely decreases the formation of fibroblast- 
derived endothelial cells, reduces post-infarct vascular density and worsens cardiac function. Conversely, stimulation 
of the p53 pathway in cardiac fibroblasts augments mesenchymal-to-endothelial transition, enhances vascularity and 
improves cardiac function. These observations demonstrate that mesenchymal-to-endothelial transition contributes to 
neovascularization of the injured heart and represents a potential therapeutic target for enhancing cardiac repair. 


The mammalian heart after acute injury heals primarily by fibrosis. 
Cardiac fibroblasts proliferate at the site of injury’ and fibroblast pro- 
liferation is accompanied by recruitment of endothelial cells. Endothelial 
cells contribute to neovascularization of the injury region” and promote 
repair’. A close interaction between fibroblasts and endothelial cells 
is thought to regulate wound healing*. A subset of endothelial cells, by 
undergoing endothelial-mesenchymal transition, generates fibroblasts 
in the injury region’ and cardiac fibroblasts express pro-angiogenic mol- 
ecules that in turn promote angiogenesis®’. However, cardiac fibroblasts 
are thought to be terminally differentiated cells*’ and whether they have 
the ability to adopt an endothelial phenotype and directly contribute to 
neovascularization after cardiac injury is not known. Here we demon- 
strate that cardiac fibroblasts undergo mesenchymal-endothelial trans- 
ition to generate de novo endothelial cells in the injured heart and show 
that mesenchymal-—endothelial transition (or MEndoT) can be augmen- 
ted to enhance cardiac repair. 


Cardiac fibroblasts adopt endothelial-cell-like fates 


We used a genetic fate map strategy to label cardiac fibroblasts by cross- 
ing transgenic mice harbouring a tamoxifen-inducible Cre recombi- 
nase driven by fibroblast-specific regulatory sequence of the «2 (type 1) 
collagen gene (Colla2-CreERT)"*” with the lineage reporter strain 
(Rosa26R“7°™"°) to create Colla2-CreERT:R26R“'°™” progeny mice. 
In these mice, administration of tamoxifen results in activation of Cre 
recombinase, and cells expressing Colla2 at the time of tamoxifen admin- 
istration are irreversibly labelled by tdTomato fluorescence. We admi- 
nistered tamoxifen for 10 days to adult Colla2-CreERT:R26R‘“1°"*° 
mice. Five days after cessation of tamoxifen administration, we observed 
that approximately 55% of all non-myocyte cells exhibited tdTomato 
fluorescence and greater than 96% and 99% of tdTomato fluorescent 


cells expressed the cardiac fibroblast markers domain discoidin recep- 
tor 2 (DDR2) and vimentin (Extended Data Fig. la—c). Immunofluores- 
cent staining showed that 87 + 9% and 99 + 0.5% (mean + s.e.m.) of 
tdTomato-labelled cells expressed DDR2 and vimentin, respectively, 
supporting flow cytometry data (Extended Data Fig. 1d, e). td Tomato cells 
did not express endothelial markers VECAD and CD31 (99.9 + 0.06% 
and 99.8 + 0.02% negative, respectively; mean + s.e.m.) (Extended Data 
Fig. 1f, g), and did not express the cardiac progenitor marker c-Kit nor 
markers of smooth muscle, macrophages and lymphatics (Extended Data 
Fig. 1lh-k). Cardiac myocytes did not express Cre recombinase as prev- 
iously shown’°. Taken together these data strongly suggest that cells exhib- 
iting tdTomato fluorescence in hearts of Colla2-CreERT:R26R‘110"*"° 
mice are cardiac fibroblasts and do not express canonical markers of 
other cardiovascular cell types. 

We subjected Colla2-CreERT:R26R“T°™"° mice to ischaemia- 
reperfusion cardiac injury 5 days after cessation of tamoxifen injection. 
By day 3 post-injury, 35 + 3% (mean = s.e.m.) of labelled cardiac fibro- 
blasts in the region of injury expressed the endothelial-specific marker 
VECAD, while in sham-injured animals only rare labelled cells expressed 
VECAD (<0.3%) (Fig. la-c). Approximately 24 + 4%, 44 + 4% and 
35 + 3% (mean + s.e.m.) of labelled cardiac fibroblasts also expressed 
other endothelial markers such as endothelial nitric oxide synthase (eNOS) 
and the endothelial tight junctional proteins claudin 5 (ref. 14) and occlu- 
din", respectively (Fig. la—c). Mesenchymal-endothelial transition was 
most pronounced in the injury border zone and significantly decreased 
in regions remote from the infarct (Fig. 1c). The fraction of cardiac fibro- 
blasts expressing VECAD increased between 1 and 3 days post-injury and 
remained similar at 3, 7 and 14 days (Fig. 1d). The fraction of tdTomato- 
positive cells expressing VECAD in sham-injured animals at 3, 7 and 
14 days was 0.3 + 0.1%, 1.4+ 1.4% and 0.6 + 0.4% (mean + s.e.m., 
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Figure 1 | Cardiac fibroblasts adopt endothelial cell fates after cardiac 
injury. a, b, Hearts from Colla2-CreERT:R26R'7™° mice immunostained 
for endothelial markers (arrowheads). c, tdTomato™ fibroblasts (%) expressing 
endothelial markers (*P < 0.05 versus sham, +P < 0.05 versus injury border 
zone). d, Temporal expression of VECAD (*P < 0.005 versus sham, 

+P < 0.05 versus day 1 and 2, {P < 0.05 versus day 1). e, STED microscopy 
demonstrating tdTomato* VECAD* cell (arrow; open arrowhead shows 
tdTomato endothelial cell). f-h, DiO-stained capillary in longitudinal section 


P> 0.05, one way ANOVA), demonstrating no temporal difference in 
the fraction of tdTomato-labelled cells expressing VECAD after sham 
injury. 

As fibroblasts lie in close apposition to endothelial cells and pericytes, 
we performed super-resolution microscopy to validate our observations 
with confocal microscopy. Stimulated emission deletion (STED) micro- 
scopy is a form of super-resolution microscopy that provides an aver- 
age lateral resolution of 30-40 nm compared to confocal microscopy, 
which provides an average lateral resolution of 250 nm (ref. 15). Using 
STED we observed that tdTomato cells after injury express VECAD and 
can be distinguished from closely apposed endothelial cells not expres- 
sing tdTomato label (Fig. le). STED microscopy also demonstrated 
that tdTomato cells did not express pericyte markers NG2 or CD146 
(Extended Data Fig. 11, m). 

We next investigated whether fibroblast-derived endothelial cells 
incorporate into capillaries in the infarct border zone. We perfused 
Colla2-CreERT:R26R7°"*"° mice with a fluorescent lipophilic dye 
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Luminal area (%) covered by 
fibroblast-derived endothelium 
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(f, g; arrowheads show tdTomato* DiO* cells) or cross-section (h; cyan 
arrowheads show DiO-stained inner and outer endothelial cell membranes; 
white arrowheads show tdTomato~ endothelial cell; unfilled arrowhead shows 
tdTomato endothelial cell). Scale bar: 5 tum. i, Luminal surface area occupied 
by fibroblast-derived endothelium (*P < 0.005 versus sham, +P < 0.05 versus 
day 3). j, acLDL uptake by tdTomato* endothelium (arrowheads) (n = 3 
animals per group per time point). All graphs show mean + s.e.m.; scale bar, 
10 Lm unless stated in the legend. 


DiO that labels endothelial cell membranes and has been used to 
identify the vasculature of solid organs including the heart’®. In both 
longitudinal (Fig. 1f, g) and transverse sections of capillaries (Fig. 1h) 
in the injury border zone, we observed that endothelial cells bearing 
the fibroblast label lined the lumen of the blood vessel, but the contri- 
bution was minimal in sham-injured animals (Fig. 1f). Luminal DiO* 
tdTomato* cells expressed VECAD (Extended Data Fig. 2a), thus con- 
firming the endothelial phenotype. Luminal surface area occupied by 
fibroblast-derived endothelial cells peaked at 7 days post-injury and re- 
mained similar at 14 days (Fig. 1i). Endothelial cells are known to take 
up acetylated LDL (acLDL)’”. We systemically injected Colla2-CreERT: 
R26R‘1°™*" mice 3 days after cardiac injury with fluorescent-labelled 
acLDL and observed that 41 + 8% (mean + s.e.m.) of fibroblast-derived 
endothelial cells took up acLDL, demonstrating their functional sim- 
ilarity to native endothelial cells (Fig. 1)). 

A subset of cardiac fibroblasts after cardiac injury become myofibro- 
blasts and express o-smooth muscle actin (#SMA)'*”’. We stained for 
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aSMA and found that 3 days after injury, 11 + 3% (mean + s.e.m.) of 
labelled cardiac fibroblasts in the injured region expressed «SMA but 
very few labelled cells co-expressed “SMA and VECAD, suggesting that 
cardiac fibroblasts undergoing mesenchymal-endothelial transition are 
distinct from myofibroblasts (Extended Data Fig. 2b-f). 

We next determined whether substantial recombination in endothe- 
lial cells (that is, leaky promoter elements) could potentially confound 
our findings. Stochastic induction of Cre can occur after cardiac injury”. 
We injected Colla2-CreERT:R26R‘11°™*° mice with vehicle (corn oil) 
instead of tamoxifen and observed rare labelled cardiac fibroblasts 
(<0.07% at 3 and 7 days) after ischaemic injury, demonstrating that 
injury alone did not lead to significant Cre activation (Extended Data 
Fig. 3a-c). 

Endothelial cells are known to undergo endothelial-mesenchymal 
transition and adopta fibroblast phenotype after cardiac injury*. We inves- 
tigated whether endothelial cells undergoing endothelial-mesenchymal 
transition would exhibit tdTomato fluorescence and be mistakenly in- 
cluded in our analysis. We performed immunofluorescent staining for 
Coll and observed that approximately 6 + 1% (mean = s.e.m.) of endo- 
thelial cells after injury expressed Coll but none of these cells exhibited 
tdTomato fluorescence (Extended Data Fig. 3d). Conversely, td Tomato- 
labelled cells in the injury region that expressed VECAD did not stain 
for Coll (Extended Data Fig. 3e). Next, we crossed Colla2-CreERT: 
R26R4T™MX0 mice with Coll-GFP transgenic mice, which have GFP 
expression directly driven by the Colla1 promoter. GFP expression thus 
serves as a useful real-time reporter of Coll expression”'””. Progeny 
mice (Colla2-CreERT:R26R11°™"°:Col1-GEP) were administered ta- 
moxifen and subjected to ischaemic myocardial injury. At 3 days post- 
injury, tdTomato-labelled cells that expressed VECAD did not exhibit 
GFP fluorescence (Extended Data Fig. 4a). Approximately 3 + 1% (mean + 
s.e.m.) of VECAD-positive cells were GFP positive but with the excep- 
tion of rare cells, VECAD* GFP* cells did not exhibit tdTomato fluor- 
escence (Extended Data Fig. 4b). 

Immunofluorescent staining for Cre protein in Colla2-CreERT: 
R26R“T°™"° mice hearts after injury demonstrated that t{Tomato* 
VECAD* cells did not express Cre (Extended Data Fig. 4c-i), consis- 
tent with the earlier observation that these cells do not express type 1 
collagen. In the absence of tamoxifen, tdTomato cells expressing Colla2 
would be expected to have Cre in their cytoplasm. In hearts of Colla2- 
CreERT:R26R'41°™"°.Col1-GFP mice, tdTomato-labelled cells that 
expressed type 1 collagen (GFP positive) exhibited cytoplasmic but not 
nuclear Cre staining (Extended Data Fig. 4j-m). Taken together, these 
observations demonstrate the fidelity of the Cre driver and argue against 
a ‘leaky’ Cre as the basis for our findings. 

Next we investigated whether Colla2-expressing endothelial pro- 
genitors in the heart or bone marrow could have been labelled during 
administration of tamoxifen and subsequently generated tdTomato 
endothelial cells. We analysed the heart and bone marrow but observed 
that less than 0.5% of tdTomato cells in the heart (Extended Data Fig. 5a, b) 
or bone marrow (Extended Data Fig. 5c-f) expressed endothelial pro- 
genitor markers (CD133, CD34 and Flk1)*’. Bone-marrow-derived 
mesenchymal stem cells did not exhibit tdTomato fluorescence, with 
the exception of rare cells (Extended Data Fig. 5g). Collectively, these 
observations suggest that substantial labelling of endothelial progeni- 
tors did not occur, making them an unlikely source of tdTomato* endo- 
thelial cells. 

Finally, we subjected FSP1-Cre:R26R‘“1°™*"° mice to ischaemic car- 
diac injury to confirm mesenchymal-endothelial transition with another 
Cre driver. The FSP1-Cre driver has been used to track cardiac fibro- 
blast after in vivo reprogramming*”. By day 3 post injury, 31 + 4% and 
23 + 5% of tdTomato-labelled cells in the injury region expressed the 
endothelial markers isolectin and VECAD, respectively, compared to 
2+ 1%and4 + 1% insham-injured animals (Extended Data Fig. 6a—c), 
confirming mesenchymal-endothelial transition observations made using 
the Colla2-CreERT driver. 
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po3 mediates MEndoT ex vivo 

We next investigated the mechanisms regulating mesenchymal—endothelial 
transition and hypothesized that cellular stress after cardiac injury has 
a role in mesenchymal-endothelial transition. p53 is an important cel- 
lular stress-response gene”, modulates reprogramming” and regulates 
epithelial-mesenchymal transition”*. We observed that 37% of tdTomato- 
labelled cardiac fibroblasts expressed p53 at 3 days post injury. In contrast, 
rare labelled cells expressed p53 in the sham-injured heart (Fig. 2a, b). 
p53 expression in labelled fibroblasts peaked at 7 days after cardiac injury 
(Fig. 2c). By day 7 after injury, approximately 91 + 7% (mean + s.e.m.) of 
tdTomato-labelled cells expressing p53 co-expressed VECAD (Fig. 2d), 
demonstrating a strong association between p53 and VECAD express- 
ion in tdTomato-labelled cells. 

To determine whether p53 has a regulatory role in mesenchymal- 
endothelial transition, we first established an ex vivo model. Cells subjected 
to types of cellular stress ex vivo, such as serum deprivation, upregulate p53 
levels’””**. We subjected cardiac fibroblasts to serum starvation and ob- 
served increased p53 levels by western blotting (Extended Data Fig. 7a, b). 
We next seeded labelled cardiac fibroblasts (99% purity by flow cyto- 
metry) on matrigel (a mixture of basement membrane proteins that facil- 
itates capillary tube formation) and subjected them to serum starvation. 
In contrast to control fibroblasts (Fig. 2e, g), serum-starved fibroblasts 
formed capillary-tube-like structures, expressed VECAD (Fig. 2f) and 
took up acLDL (Fig. 2h), consistent with adoption of an endothelial- 
cell-like phenotype. 

Using gain- and loss-of-function approaches, we determined whether 
p53 affects serum-starvation-induced mesenchymal-endothelial trans- 
ition (Fig. 2i-o). Serum deprivation induced cardiac fibroblasts to form 
tubes on matrigel (Fig. 2i, j), but addition of pifithrin-«, an inhibitor of 
p53 (ref. 29), decreased tube formation by 67% (Fig. 2k, n). Pifithrin-o 
did not have an effect on serum-fed cardiac fibroblasts (Extended Data 
Fig. 7c, d). Next, we generated mice specifically deficient in fibroblast 
p53 by crossing Colla2-CreERT:R26R‘“'°™"° mice with mice having 
both p53 alleles floxed*®. Progeny mice (Colla2-CreERT:R26R'“1°™": 
p53"”") were injected with tamoxifen at 8 weeks to delete p53 and label 
Colla2-expressing p53-deficient cells (td Tomato * ). Cardiac fibroblasts 
were subsequently harvested and subjected to serum starvation. The 
absence of p53 decreased capillary tube formation on matrigel by 95% 
(Fig. 21, n). We subsequently investigated whether activation of p53 sig- 
nalling enhanced serum-deprivation-induced mesenchymal-—endothelial 
transition. Cardiac fibroblasts subjected to serum starvation were treated 
with the small molecule RITA (reactivation of p53 and induction of 
tumour apoptosis) that inhibits ubiquitin-mediated p53 degradation 
and enhances p53 signalling’’. RITA enhanced tube formation by 50% 
(Fig. 2m, n). RITA also increased tube formation in serum-fed cardiac 
fibroblasts but to a lesser degree (Extended Data Fig. 7c, d). After serum 
deprivation, quantitative real-time polymerase chain reaction (qPCR) 
demonstrated 4-20-fold increased expression of endothelial-specific 
genes but treatment with pifithrin-« or the genetic deletion of p53 in 
cardiac fibroblasts significantly blunted induction of endothelial gene 
expression (Fig. 20). Conversely, treatment of serum-starved cardiac fibro- 
blasts with RITA significantly increased expression of endothelial-specific 
genes by 3-7-fold (Fig. 20). Interestingly, overexpressing p53 in wild-type 
cardiac fibroblasts grown under serum-fed conditions did not induce 
expression of VECAD or other endothelial-specific genes, suggesting 
that enhancing p53 signalling in the absence of serum starvation is not 
sufficient to induce mesenchymal-endothelial transition. 

We next performed chromatin immunoprecipitation (ChIP) for p53 
on serum-starved cardiac fibroblasts and observed that p53 directly 
binds to promoter regions of endothelial-specific genes and the endo- 
thelial transcription factors Hoxa9 (ref. 32) and Hoxd3 (ref. 33) that are 
known to have an important role in endothelial differentiation (Fig. 2p). 
Gene expression of Hoxa9 and Hoxd3 was also increased in cardiac 
fibroblasts by 6.53 + 1.74 and 7.18 + 1.16-fold, respectively, after serum 
starvation (mean + s.e.m., P< 0.05 compared to cardiac fibroblasts in 
10% serum). Although an indirect effect of p53 cannot be excluded, 
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Figure 2 | Cardiac fibroblasts upregulate p53 after injury and p53 mediates 
mesenchymal-endothelial transition ex vivo. a, b, p53 immunostaining in 
injured hearts (arrowheads show tdTomato* p53" cells). c, Temporal p53 
expression in labelled fibroblasts (*P < 0.05 versus sham, n = 3 animals per 
time point). d, Co-expression of p53, VECAD and tdTomato (arrowheads). 
e-h, tdTomato’ VECAD* tubes (e, f) (scale bar, 250 jum) and acLDL uptake 
(g, h) after serum starvation (arrowheads, n = 4) (scale bar, 20 um). Right panel 
of h shows confocal image (xz plane) showing acLDL internalization. 


these observations suggest that p53 at least in this model initiates an 
endothelial gene expression program by directly inducing transcrip- 
tion of endothelial-specific genes. 

Using this ex vivo model, we next asked whether mesenchymal- 
endothelial transition was reversible. Transforming growth factor-B 
(TGF-B) enhances endothelial-mesenchymal transition’ and TGF-B 
added to cardiac fibroblasts at the onset of serum starvation prevented 
tube formation (Extended Data Fig. 8a, b) and induction of VECAD 
expression (0.98 + 0.03-fold change in VECAD with TGF-B, mean + 
s.e.m., P > 0.05, n = 3). Moreover, when TGF-B was added to serum- 
starved cardiac fibroblasts after they had already formed tubes, it led to 
99% regression of tube formation (Extended Data Fig. 8c, d, g). A simi- 
lar effect was observed with adding back serum (Extended Fig. 8e-g). 
VECAD expression also decreased by 56.4 + 2.4% (mean + s.e.m, P< 0.05, 
n = 3). Addition of pifithrin-« to serum-starved cardiac fibroblasts that 
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i-m, Tube formation of cardiac fibroblasts in 10% serum (i) or 0% serum with 
PBS (j), 100 1M pifithrin-« (k), 0.1 1M RITA (m), or p53 deletion (I) (bright- 
field and fluorescence overlay). Scale bar, 250 jum. n, Quantification of tube 
length (**P < 0.005 versus 10% serum, +P < 0.005 and *P < 0.05 versus 
starved cells, n = 3). 0, Endothelial gene expression in cardiac fibroblasts 

(*P < 0.005 versus 10% serum, +P < 0.05 versus PBS, n = 8). p, ChIP with p53 
(*P < 0.05). All graphs show mean = s.e.m.; scale bar, 10 [im unless mentioned. 


had already formed tubes demonstrated significant disruption of formed 
tubes compared to PBS-treated controls (Extended Fig. 8h-n). These 
observations suggest that p53 is required for maintaining the endothe- 
lial phenotype of the fibroblast-derived endothelial-like cell. Notably, 
the fraction of fibroblast-derived endothelial cells was maintained at 
14 days after cardiac injury in vivo (Figs 1d and 2c) despite declining p53 
levels suggestive of other factors stabilizing the endothelial phenotype. 


po3 mediates MEndoT in injured heart 

To determine whether p53 mediates mesenchymal—endothelial trans- 
ition in vivo, we subjected Colla2-CreERT:R26R41°™*"°:p53"" mice 
to ischaemic cardiac injury 5 days after cessation of tamoxifen. Mice 
with intact p53 exhibited 6 + 0.64-fold increase (mean + s.e.m.) in cardiac 
fibroblast p53 levels 3 days after cardiac injury, but in mice with fibroblast- 
specific p53 deletion (referred to as Colla2-CreERT:R26R 1"? p53 6° 
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(conditional knock out)) the mean p53 expression in border zone car- 
diac fibroblasts failed to increase significantly (Fig. 3a—c). The degree of 
mesenchymal-endothelial transition at 3 days post-injury in Colla2- 
CreERT:R26R‘410™"°:p53C%° mice decreased by 57% (Fig. 3d-f) and 
was associated with a decrease in capillary density in the injury region 
(Fig. 3g). Increase in vessel density in the injured heart is a critical post- 
ischaemic repair mechanism. Diminished neovascularization can lead 
to rapid decline in cardiac function after cardiac injury and is associated 
with worsened scarring**. Echocardiography on hearts of mice 7 days 
after cardiac injury showed significant worsening of cardiac function in 
p53*° mice (Fig. 3h, i). Masson’s trichrome staining demonstrated a 
greater amount of collagen deposition in the hearts of Colla2-CreERT: 

R26RTO™*"H53°K° mice (Fig. 3j, k). The degree of inflammatory 
infiltrate measured by the number of CD68-expressing macrophages 


was also significantly higher in the Colla2-CreERT:R26R‘10™"°:p53-*° 
mice (Extended Data Fig. 9a, b, d). These observations demonstrate that 
p53 is necessary for mesenchymal-endothelial transition to occur after 
ischaemic cardiac injury and that disruption of mesenchymal-endothelial 
transition is associated with diminished post-injury vascularity and car- 
diac function. 

We next looked at mechanisms inducing p53 expression in cardiac 
fibroblasts. Carboxy-terminal phosphorylation of histone H2A family 
member X (yYH2AX) isa DNA damage response pathway upstream of 
p53, is activated by hypoxic endothelial cells and shown to have a critical 
role in endothelial cell proliferation and hypoxic neovascularization®. We 
observed that 90% of tdTomato cells that upregulated p53 co-expressed 
yH2AX, suggesting that the YH2AX pathway is strongly associated with 
p53 expression in fibroblasts undergoing mesenchymal-endothelial 
transition (Extended Data Fig. 10a-c). 
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Figure 3 | Mesenchymal-endothelial transition after cardiac injury is 

p53 dependent. a-c, p53 immunostaining in Colla2-CreERT:R26R‘7°"*"° 
and Colla2-CreERT: RI6RUTOMAO, »p530° hearts (a, b; arrowheads) and 
quantification of p53 expression in labelled fibroblasts (c; *P < 0.005). 

d-f, VECAD immunostaining in labelled fibroblasts (d, e; arrowheads) and 
labelled fibroblasts (%) expressing VECAD (f; *P < 0.005). g, Number of 
endothelial cells per high-power field (HPF) (*P < 0.005, n = 5 mice). 

h, i, Cardiac function before and 7 days after injury (*P < 0.05, n = 9 mice). 
j, Masson’s trichrome staining 14 days after injury. k, Quantification of fibrosis 
(*P < 0.05) (LV, left ventricle, n = 4 mice unless mentioned). All graphs show 
mean = s.e.m.; scale bar, 10 um. 
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po3 activation enhances MEndoT after cardiac injury 


We investigated whether stimulation of p53 signalling after cardiac 
injury enhances mesenchymal-endothelial transition after acute isch- 
aemic cardiac injury. Colla2-CreERT:R26R‘“1°™*° mice injected with 
RITA daily for 3 days after cardiac injury exhibited significantly increased 
p53 expression in labelled fibroblasts (13 + 0.8-fold in RITA-injected 
animals versus 6-fold increase in PBS-injected animals) (Fig. 4a—c). RITA 
significantly enhanced the degree of mesenchymal-endothelial trans- 
ition (Fig. 4d—f) and enhanced mesenchymal-endothelial transition was 
associated with a 26% increase in endothelial cells in the injury border 
zone (Fig. 4g). Increased neovascularization can lead to decreased col- 
lagen deposition and rapid improvement in post-injury cardiac function’. 
The area of collagen deposition 7 days after cardiac injury decreased by 
57% in RITA-injected animals (Fig. 4h, i). Echocardiography demon- 
strated significantly improved cardiac function in RITA-injected mice 
compared to PBS-injected control mice (Fig. 4j, k). 

Consistent with greater vascularity and better cardiac function, the 
inflammatory infiltrate was substantially reduced in RITA-injected 
animals (Extended Fig. 9a, c, d). As RITA increases p53 levels, we also 
looked at deleterious effects of increased p53 levels, such as apoptosis. 
We subjected Colla2-CreERT:R26R"41°™" mice to cardiac injury and 
consistent with published reports, did not observe increased p53 expres- 
sion in myocytes after injury (Extended Data Fig. 9e, f)*°. In RITA-treated 
mice 3 days after injury there was not a significant increase in p53- 
expressing TUNEL-stained nuclei compared to PBS-injected controls 
(Extended Data Fig. 9g). 

To demonstrate that increased mesenchymal—endothelial transition 
after administration of RITA was p53-dependent in cardiac fibroblasts, 
we injected RITA into Colla2-CreERT:R26R41°™*"°-p53*° mice and 
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Figure 4 | RITA enhances mesenchymal-endothelial transition after 
cardiac injury. Hearts from PBS-treated (white bars) or RITA-treated 
(blue bars) Colla2-CreERT:R26R™7°™* mice. a-c, p53 immunostaining in 
labelled fibroblasts (a, b; arrowheads) and quantification of p53 expression 
(c; *P < 0.005). d, e, VECAD immunostaining in labelled fibroblasts 
(arrowheads). f, Labelled fibroblasts (%) expressing VECAD (*P < 0.05). 
g, Endothelial cells per HPF (*P < 0.05). h, Masson’s trichrome staining 
(blue, arrowheads) near apex (top row), between mid-ventricle and apex 
(middle row) and mid-ventricle (bottom row) (n = 14 mice). i, Quantification 
of fibrosis (*P < 0.001, n = 14 mice). j, k, Cardiac function before and 7 days 
after injury (*P < 0.05, n = 8 mice). n = 4 mice unless mentioned; all graphs 
show mean = s.e.m.; scale bar, 10 um. 
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observed that RITA failed to significantly enhance mesenchymal- 
endothelial transition. The degree of mesenchymal-—endothelial transition 
was not statistically different from that observed in PBS-treated Colla2- 
CreERT:R26R 1100953 KO mice (Extended Data Fig. 9h, i). These 
observations suggest that RITA predominantly enhances mesenchymal- 
endothelial transition by activating p53 in cardiac fibroblasts. 


Discussion 


Our report suggests that cardiac fibroblasts possess a degree of native 
cellular plasticity that enables them to adopt endothelial-cell-like fates 
after cardiac injury. Recruitment and proliferation of endothelial cells 
in the ischaemic heart is a critical cardiac repair response. Our data 
point to the fibroblast as a novel and robust source of endothelial cell 
generation in the injured heart. Mesenchymal—endothelial transition 
(or MEndoT), a hitherto unreported biological phenomenon, appears 
to have an important physiological role in cardiac repair, as disruption 
of mesenchymal-endothelial transition worsened post-infarct vascularity 
and cardiac function. Teleologically, mesenchymal—endothelial transition 
provides the heart with an efficient strategy to increase neovascular- 
ization rapidly in the injured region. The use of a small molecule to 
augment mesenchymal-endothelial transition and increase neovascu- 
larization after injury suggests that mesenchymal-endothelial trans- 
ition may represent a therapeutic target for enhancing vascularity and 
repair of ischaemic tissues. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Animal care and use. All animal studies were approved by the Institutional 
Animal Care and Use Committee at the University of North Carolina, Chapel 
Hill and the University of California, Los Angeles. 

Generation of transgenic and conditional knockout mice. Collagenla2-CreERT: 
R26R'“T°™*"° mouse lines were obtained by crossing collagenla2-CreERT mice with 
lineage reporter R26R'“7°™"° mice. To obtain collagenla2-CreERT:R26R1°"*”°: 
p53°*° mice, collagenla2-CreERT:R26R'“"°™"*" mice were crossed to p53" mice 
and backcrossed to generate collagen1a2-CreERT:R26R{1°"*"°:p53"/" mice. Colla2- 
CreERT:R26R“"7°™"° mice were crossed with Coll-GEP mice to create progeny 
Colla2-CreERT:R26R'41°™°:Coll-GFP mice. FSP1-Cre mice were crossed with 
the R26R'“7°"*"° mice to create FSP1-Cre:R26R“""°™""° mice. Tamoxifen (1 mg) 
(Sigma) was injected intraperitoneally for 10 days to induce Cre-mediated recom- 
bination in Colla2-CreERT mice. Five days following cessation of tamoxifen 
animals were subjected to ischaemic cardiac injury. All mice were on a C57BL/6 
background. 

Murine cardiac injury model. Mice (both male and female), 8-10 weeks old, were 
randomly allocated to sham or ischaemia-reperfusion cardiac injury. Investigators 
performing surgeries and cardiac function studies were blinded to mouse genotype 
or treatment. Mice were initially anaesthetized with 3% isoflurane, maintained at 
2% isoflurane, and intubated using a Harvard Rodent Volume-Cycled ventilator. 
Myocardial injury was induced by 30 min ligation of the left anterior descending 
(LAD) coronary artery followed by reperfusion. Sham injury was performed in the 
same manner, with a ligature passed under the LAD, but the LAD was not ligated. 
Group size was estimated based on previously observed mortality rates following 
surgery. No animals were excluded from analysis unless they died during the surgical 
procedure. At 1, 2, 3, 7 or 14 days after injury, the mice were anaesthetized with 
pentobarbital (45 mg kg’) and the left ventricle was perfused with 5 ml PBS fol- 
lowed by 2 ml 4% PFA. For RITA-treated mice, beginning 24h after injury RITA 
(Millipore) was administered intraperitoneally at 0.3 mg kg’ once daily for 3 days 
or until harvest. For acLDL staining, 1 1g fluorescent labelled acLDL (Invitrogen) 
per gram body weight was injected via a catheter inserted in the jugular vein and 
hearts harvested as described 4h later. For DiO staining, DiO was prepared as 
described"* and 2 ml of 120 ppg ml! DiO perfused before PFA perfusion’®. 

After perfusion, the heart was further fixed for 1 h in 4% PFA at 4°C and after 

1h the PFA was replaced with fresh 4% PFA and incubated at 4 °C for another 3 h. 
At the end of 4h, the hearts were sucrose embedded overnight in a 30% sucrose 
solution (MP Biochemicals) and then frozen in OCT (Tissue-Tek). 7 tum sections 
were prepared in a longitudinal or transverse orientation. 
Echocardiography. Echocardiography was performed in conscious mice. The hair 
over the anterior chest was removed by a depilation cream and warmed Aquasonic 
gel applied over the thorax. Mice were held firmly by hand for the duration of the 
procedure (approximately 5-10 min) and conditioned daily for the procedure 
starting 3 days before the procedure. The probe was positioned over the chest in a 
parasternal position. Parasternal long axis B-mode and M-mode images are recorded. 
Measurements and analysis were then performed as described’®. The echocardio- 
grapher was blinded to the genotype and treatment of the animal being examined. 
Immunohistochemistry and histology, confocal imaging and quantitation, 
super-resolution microscopy. Immunofluorescent staining on frozen sections 
(7 um) was performed using primary antibodies to VECAD (catalogue number 
ab33168, Abcam), eNOS (ab66127, Abcam), claudin 5 (ab53765, Abcam), occludin 
(ab31721, Abcam), YH2AX (ab2893, Abcam), p53 (ab31333 and ab26, Abcam), 
Coll (ab6308, Abcam), podoplanin (ab11936, Abcam), o-SMA (ab5694, Abcam), 
CD68 (ab125212, Abcam), CD146 (ab75769, Abcam), Cre (BIOT-106L & PRB-106P, 
Covance), troponin (SC-8121, Santa Cruz), NG2 (AB5320, EMD Millipore) and 
associated APC or fluorescein conjugated secondary antibodies (Abcam, Millipore, 
Invitrogen) as per manufacturer instructions. Labelled sections were imaged using 
a Leica SP2 AOBS Upright Laser Scanning Confocal Microscope (Leica Microsystems). 
Five independent images for each area (that is, injury border zone, remote from 
injury, sham) were obtained and used for quantitative analysis. Co-localization ana- 
lysis of confocal images was performed using Image J software (NIH). Mesenchymal- 
endothelial transition percentages were derived by counting the number of dually 
labelled cells and dividing by the number of tdTomato-positive cells. Quantification 
of vascular area derived from fibroblasts was performed using the JACoP Image J 
plugin’’. p53 expression levels in tdTomato-positive cells were determined using 
the JACoP Image J plugin and expressed as a Manders coefficient normalized to 
p53 levels in sham injury”’. In each case, five independent images from each area 
were analysed from sections prepared from each mouse. For super-resolution micro- 
scopy, a custom-made Stimulation Emission Depletion Super-resolution micro- 
scope was used. This microscope has a confocal channel for tdTomato and an 
ATTO0647 flurophore super-resolution channel that was used for VECAD, NG2 or 
CD146 visualization. Masson’s trichrome staining was performed on heart sec- 
tions as described””. 
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Fibroblast isolation and culture. Cardiac fibroblasts were isolated from the 
explanted hearts of euthanized uninjured mice. The hearts were explanted and 
washed three times with 1X HBSS (Gibco). The heart was minced into approxi- 
mately 1-mm/? sized pieces and digested using 10 ml of a 0.1% trypsin solution 
(Gibco) with 50 U ml! collagenase II (Worthington). Five sequential digestions 
were performed at 37 °C, the cells collected and passed through a 40 iM strainer 
and plated in IMDM, 1  penicillin/streptomycin, 10% FBS for 1 h at 37 °C. After 
1h the medium was changed to F12K 1X penicillin/streptomycin, 10% FBS (Gibco), 
10ng ml! leukaemia inhibitory factor (LIF) (Millipore) and 10 ng ml! basic 
fibroblast growth factor (bFGF) (Millipore). Cells were maintained under these 
conditions until they became confluent in 7-10 days. 

Matrigel tube formation and LDL uptake assay. 6 X 10* fibroblasts per cm? 
were cultured overnight at 37 °C, 5% CO, on Growth Factor Reduced Matrigel 
Basement Membrane Matrix (BD Biosciences) coated wells of Nunc Lab-Tek II 
CC2 chamber slides (Thermo). Serum starved or unstarved cells were cultured in 
IMDM, 1x penicillin/streptomycin or IMDM, 1  penicillin/streptomycin, 10% 
fetal bovine serum (Gibco), respectively. Pifithrin-« 100 uM (P4359, Sigma) or 
RITA 0.1 1M (506149, EMD Chemicals) were added to the cells cultured with the 
above culture medium. Acetylated LDL (Invitrogen) uptake was performed as 
described”. 

Flow cytometry. Flow cytometric analysis for cell surface markers were done 
using antibodies CD31-APC (17-0311, eBioScience), CD34-FITC (11-0341, eBio- 
science), CD133-FITC (11-1331, eBioscience), FIk1-APC (17-5821, eBioscience), 
VECAD-APC (17-1441, eBioscience), CD45-APC (103111, Biolegend), c-Kit APC 
(561074,BD Bioscience), DDR2 (SC-7555, Santa Cruz) and vimentin (Ab1620, 
Millipore). For vimentin and DDR2, a secondary APC conjugated anti-goat anti- 
body was used (SC3860, Santa Cruz). Cultured cardiac fibroblasts were dissociated 
using accutase (Innovative Cell Technologies) and immunostained in FACS buffer 
(0.1% BSA PBS) at 1 X 10° per ml for 20-30 min at 4 °C, followed by washing twice 
with FACS buffer and subsequently analysed in Beckman-Coulter (Dako) CyAn 
ADP. Data obtained was analysed and represented using Flowjo software. 
Quantitative real-time PCR. RNA was isolated from cardiac fibroblasts cultured 
in the presence or absence of serum (IMDM, 1% penicillin/streptomycin, with 
without 10% FBS), or in medium without serum and containing either 100 1M 
pifithrin-c or 0.1 uM RITA, grown for 48 h at 37 °C, 5% CO. RNA isolation and 
reverse transcription was performed using the SV Total RNA Isolation Kit and the 
Reverse Transcription System (Promega). qPCR was performed using the SensiMix 
SYBR and Fluorescein Kit (Quantace) on an iQ5 thermal cycler (BioRad). 
Western blot. Protein was harvested from cardiac fibroblasts cultured in the 
presence or absence of serum (IMDM, 1X penicillin/streptomycin, with or with- 
out 10% FBS) for 24, 48 and 72h at 37 °C, 5% CO3. 

Concentration normalized protein was prepared with SDS loading buffer and 

run on a 12X Mini-Protean TGX gel (BioRad) at 300V for 25 min. The protein 
was transferred from the gel to a nitrocellulose membrane using the Trans-Blot 
Turbo System (BioRad). After blocking with TBST +3% cold fish gelatin, the 
membrane was probed using primary antibodies to p53 (ab31333, Abcam) and 
a-tubulin (T6199, Sigma), washed with TBST, and labelled using fluorescently con- 
jugated secondary antibodies (LI-COR Biosciences). After washing with TBST, the 
membrane was visualized on an Odyssey scanner (LI-COR Biosciences). Densi- 
tometry analysis was performed using the Gel Analyzer unit of Image] software 
(NIH). 
Chromatin immunoprecipitation (ChIP). Cardiac fibroblasts (30 X 10°) were 
fixed in 1% formaldehyde, lysed in lysis buffer (50 mM Tris-HCl pH 8, 10 mM 
EDTA, 1% SDS, protease inhibitor cocktail Set I CALBIOCHEM) and sonicated 
using a EpiShear Multi-Sample Sonicator (Active Motif), leading to fragments 
between 300 and 1,000 bp. ChIP was performed using a commercially available 
ChIP-IT High Sensitivity Kit (Active Motif) according to manufacturer’s instruc- 
tions. DNA-bound protein was immunoprecipitated using an anti-p53 antibody 
(ab31333, Abcam) and anti IgG (SC-2025, Santa Cruz) as a negative control. The 
DNA recovered was analysed by quantitative real time-PCR using different pri- 
mers sets that amplified the promoter region of Hoxa9, Hoxd3, CLDN5, Nos3 and 
negative control genes: Hoxa9 (1) 5'-TAAACTGCTCAGGCCATGCT-3’ sense, 
5'-CAGCCTGGACCCACTGAAAG-3’ antisense; Hoxd3 (1) 5'-TGCCTCAGT 
GTATTTCTCCCC-3’ sense, 5’-ATGGATAACAGTGCCCGGTG-3’ antisense; 
claudin 5 5'-CTCCGGAAGCCAACTTGGAG-3’ sense, 5'-GGACCCAGTGTG 
TCTAACCC-3’ antisense; eNOS 5’-GGGACAGCCAGAAAATGGGA-3’ sense, 
5'-ACTGCTGTCGGTCTCTTTGT-3' antisense. 

Primers for p53 binding were designed using the SABiosciences’ proprietary 
database (DECODE, DECipherment Of DNA Elements), available from http:// 
www.sabiosciences.com/chipqpcr.php. 

PCR was performed with equal amounts of specific antibody immunoprecipi- 
tated sample, control (IgG) and input. Values were normalized to input measure- 
ments and enrichment was calculated using the comparative A-ACt (AACt) 
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method. Data shown correspond to one representative assay (that is, three PCRs) 
from a total of three independent assays each run with different sets of treated 
cells. 

Statistical analysis. Statistical analysis was performed using GraphPad software 
(Prizm) using Student’s t-test (two-tailed), one-way or two-way ANOVA with 
Bonferroni post-test analysis as appropriate. Welch’s correction was used if var- 
iances between groups were significantly different. A P value of <0.05 was con- 
sidered statistically significant. Graphs present the mean value + standard error 
of the mean (s.e.m.). 
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Extended Data Figure 1 | Phenotypic characterization of tdTomato- 
labelled cells in hearts of Colla2-CreERT:R26R7°™° mice. a—c, Hearts 
were digested to obtain a non-myocyte population and subjected to flow 
cytometry to determine expression of DDR2 (b) and vimentin (c). Panel 

a serves as a control with no primary antibody added (number in each quadrant 
represents fraction of total population of cells). d, e, Immunofluorescent 
staining on sham-injured hearts of Colla2-CreERT:R26R'"'°™"? mice to 
determine expression of DDR2 (d) (green) and vimentin (e) (green) in 
tdTomato-labelled cells (red) with merged image (right; arrowheads point to 
tdTomato-labelled cells staining for DDR2 or vimentin). Scale bar, 10 pm. 
f-h, Non-myocyte cells from the heart were subjected to flow cytometry to 
determine expression of VECAD (f), CD31 (g) or c-Kit (h) (number 
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in each quadrant represents fraction of total population of cells). 

i-k, Immunofluorescent staining for 7-smooth muscle actin (i), CD68 (j) or 
podoplanin (k) was performed on frozen heart sections prepared from 8-week- 
old sham-injured Colla2-CreERT:R26R""1°™° mice after tamoxifen 
injections and co-localization analysis performed to determine the number of 
labelled cardiac fibroblasts expressing «-smooth muscle actin (99.4% negative 
for a-SMA, 1,500 cells examined), CD68 (100% negative for CD68, 1,000 cells 
examined) or podoplanin (arrowheads). Scale bar: i-k, 10 um. 1, m, STED 
super-resolution microscopy demonstrating tdTomato-labelled cells (arrows) 
not expressing NG2 (1) (green, arrows, STED channel) or CD146 (m) (green, 
arrows, STED channel). Scale bar, 10 um. 
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Extended Data Figure 2 | Expression of VECAD in fibroblast-derived Colla2-CreERT:R26R'°"*° mice underwent ischaemia reperfusion injury 

endothelial cells that take up DiO and expression of a-smooth muscle actin _ and hearts were harvested 3 days after injury and immunofluorescent staining 
and VECAD in tdTomato-labelled cells. a, Colla2-CreERT:R26R*70™° performed for o&-smooth muscle actin and VECAD. Section shows tdTomato- 
mice were subjected to ischaemic injury, injected with DiO 7 days after cardiac _ labelled cardiac fibroblasts (b), VECAD (c), «-smooth muscle actin expressing 


injury and then harvested. High magnification of a wall of a blood vessel in cells (d) and merged image (e) showing co-localization of fluorophores. 

the injured region demonstrates luminal cells staining for DiO (green, Arrowheads show labelled cardiac fibroblasts expressing «.-smooth muscle 
arrowheads), VECAD (blue, arrowheads), tdTomato (red, arrowheads) and actin and arrow shows a labelled cardiac fibroblast expressing VECAD but not 
merged image demonstrating co-localization of all three fluorophores (filled smooth muscle actin. Scale bar: b-e, 10 im. f, Fraction of labelled cardiac 
arrowheads); open arrowhead points to VECAD™ DiO* cell that does not bear _ fibroblasts that are o-smooth muscle actin’ and VECAD* or VECAD™ 

the tdTomato label. Scale bar, 10 um. b-f, Expression of o-smooth muscle (mean + s.e.m., *P < 0.05, n = 3). 


actin and VECAD in tdTomato-labelled cells. b-e, Tamoxifen-injected 
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Extended Data Figure 3 | tdTomato expression in hearts of vehicle-injected 
paneemcran ee omato mice and immunostaining for Coll and 
VECAD in tamoxifen-injected Colla2-CreERT:R26R'*7°™"° mice after 
cardiac injury. a-c, Oil-injected Colla2-CreERT:R26R"1°™° mice 
underwent ischaemia reperfusion injury and 3 days after injury, hearts were 
harvested and sectioned. Area of injury (white lines) was stained with 
Alexa488-labelled wheat germ agglutinin (a) (WGA, stains cell membranes), 
rare tdTomato-expressing cells in same field (b) (arrowhead) and merged 
image (c) showing the presence of rare labelled cells in the injury region 
(arrowhead) (28 labelled cells out of 38,000 cells counted (0.07%), n = 3 mice). 
Scale bar: 100 um. d, e, Colla2-Cre:R26R™7°™" mice (injected with tamoxifen 


as described in text) were subjected to ischaemic cardiac injury and hearts 
harvested 3 days post injury and stained for Coll and VECAD. d, Region of 
injury demonstrating a VECAD-expressing cell (blue, arrow) staining positive 
for Coll (green) but negative for the tdTomato label (merged arrow). 
Arrowheads in merged panel show tdTomato-labelled cells expressing VECAD 
but not Coll. Scale bar, 10 um. e, tdTomato-labelled cells expressing VECAD 
(arrowheads) that do not stain for Coll (green) with merged image showing 
tdTomato* VECAD* cells not staining with the Coll antibody. Out of 225 cells 
counted (n = 3 mice), we did not observe a single tdTomato*’ VECAD* cell to 
stain for Coll. Conversely, not a single VECAD* Coll” cell exhibited 
tdTomato fluorescence. Scale bar, 10 um. 
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Extended Data Figure 4 | VECAD and Cre immunostaining in heart 
sections of Colla2-Cre:R26R*7°™*°:Col1-GFP mice 3 days after injury. 
a, b, Colla2-Cre:R26R'T™*"°:Col1-GEP mice were subjected to ischaemic 
cardiac injury and hearts harvested at 3 days post injury and stained for 
VECAD. a, tdTomato-labelled cells (red, arrowheads) expressing VECAD 
(blue, arrowheads) but not GFP (green), with merged image demonstrating 
co-localization of tdTomato and VECAD but not GFP (arrowheads). 

Scale bar, 10 um. b, High magnification of a blood vessel (asterisk) outlined 
by VECAD staining (blue) demonstrating rare cell that co-localizes all three 
fluorophores (tdTomato ‘VECAD* GFP", white, arrowhead). Arrows point 
to tdTomato-positive cells expressing VECAD but not GFP. Scale bar, 

10 pm. c-f, Immunostaining for Cre protein on hearts of Colla2- 
Cre:R26R‘“T°™"°:Coll-GEP mice harvested 3 days following ischaemic injury 
to detect Cre expression in the nucleus of tdTomato-labelled cells expressing 


QO 


Vv 


FP /GFP 


DAPI/ 


VECAD. tdTomato-labelled cells (c) in area of injury (arrowheads) expressing 
VECAD (d) (arrowheads), merged image showing co-localization of 
fluorophores (e) (arrowheads) and nuclei stained for DAPI and Cre 

(f) demonstrating absence of any detectable nuclear Cre protein. Scale bar, 

10 um. g-i, Positive control demonstrating section of heart of Wt-1Cre 
transgenic mouse heart 3 days after injury with region of injury stained for 
DAPI (g), Cre (h) (red) and merged image (i) demonstrating numerous cells in 
the injury region expressing nuclear Cre (arrowheads). Scale bar, 10 um. 

j-m, Cre immunostaining in tdTomato-labelled cells expressing GFP. Area in 
region of injury with nuclei stained for DAPI (j), tdTomato expression 

(k) (arrowhead), GFP expression (arrowhead) (1) and merged image (DAPI, 
Cre and GFP) (m) showing Cre staining (red channel) localized to the 
cytoplasm of GFP-expressing cell (arrowhead). Scale bar, 10 um. 
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Extended Data Figure 5 | Flow cytometry for endothelial progenitor 
markers on non-myocyte cells harvested from uninjured hearts of Colla2- 
CreERT:R26R‘7°™"° mice and bone marrow cells isolated from the same 
animal. a, b, Hearts of mice were digested, myocytes discarded and the entire 
non-myocyte population without any further selection was subjected to flow 
cytometry. Expression of tdTomato and CD45 and Flk-1 (APC fluorophore) 
(a) and CD133, CD34 and combined expression of CD34 and Flk-1 in 


—- — 
10? 10° 10* 10° 
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non-myocyte population (b). c-f, Bone marrow cells were isolated from 
Colla2-CreERT:R26R'"°™"° mice and without further culture subjected to 
flow cytometry. Expression of tdTomato and CD45 (c), Flk-1 (d), CD133 (e), 
and CD34 and combined expression of Flk1 and CD34 (f) in bone marrow cells. 
g, Expression of tdTomato in bone-marrow-derived mesenchymal stem-cell 
colonies (black arrowhead points to rare tdTomato-positive cell; white 
arrowheads point to mesenchymal stem cells). Scale bar, 50 um. 
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Extended Data Figure 6 | Mesenchymal-endothelial transition in FSP1- Injury region demonstrated tdTomato-labelled cells expressing VECAD or 
Cre:R26R%7™*° mice. a, b, FSP1-Cre:R26R'47°™"° mice were subjected to _isolectin (arrowheads, n = 4). Scale bar, 10 um. ¢, Quantification of labelled 
sham (a) or ischaemia-reperfusion (b) cardiac injury. Hearts were harvested fibroblasts that express VECAD or isolectin in sham-injured animals and in the 
3 days after injury and stained for endothelial marker VECAD or isolectin. injury border zone (mean = s.e.m., *P < 0.01, n = 4). 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b 
5 
= * oe 
08 
OR 4 * 
> oo 
2 2 2 22 
ic £ = £Oo 
x © a Qe 
€ s ~ a 22 
3 E g E ae? 
® 2 2 2 oe 
n oO (0) o Ag 
xe no (2) ao c =| 
roy x xs x o 2 
= °o Fas] ° De 9 
52 
BG 
Oc 
Lo 4 
-tubulin a 
ae Ge «ass am ° > 
0 
S < a @ 
& oy we av 
< s 7 s 
\o% f & 
$ os s° 
c d 
Pifithrin RITA 200 
€ 150 
= 
= 
D 
Cc 
2 100 
3 
2 
50 
10% Serum 10% Serum 
eS) 
\o 
S 
o\e 
RS) 


Extended Data Figure 7 | Effect of serum starvation on p53 levels and effects quantification of western blot (mean + s.e.m., *P < 0.05 compared to cells in 
of pifithrin-a and RITA on tube formation of serum-fed cardiac fibroblasts. 10% serum). c, Effect on tube formation after adding pifithrin-« or RITA to 
a, Western blot for p53 in cardiac fibroblasts subjected to serum starvation for _ cardiac fibroblasts grown in 10% serum. Scale bar, 250 um. d, Quantification of 
24, 48 and 72h (representative sample from n = 3). b, Densitometric tube formation (mean + s.e.m., *P < 0.05, n = 3). 
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Extended Data Figure 8 | Effect of adding TGF-f to serum-starved cardiac 
fibroblasts, or adding TGF-f, serum or pifithrin-a on tubes that have 
already formed. a, b, Tube formation of cardiac fibroblasts subjected to serum 
starvation in the absence (a) or presence (b) of TGF-B. (TGF-B was added at the 
onset of serum starvation.) Scale bar, 250 jum. c-f, Effect of adding TGF-B or 
serum to tubes that had already formed. c, e, Twenty-four hours after serum 
starvation (after tubes had already formed), PBS was added to tubes shown in 
cand e and photographs taken after another 24h. d, f, After tubes had already 
formed (24h of serum starvation), TGF-B (d) or serum (f) was added and 
photographs taken after another 24h (note clumping of cells and regression of 
tubes in d and f). Scale bar: c-f, 250 jum. g, Effect of adding TGF-B or serum to 
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tubes that had already formed, expressed as a percentage decrease in tube 
length. h-n, Effect of adding pifithrin-« to serum-starved cardiac fibroblasts 
that had already formed tubes. h, k, Tube formation in cardiac fibroblasts 
serum starved for 24h. Scale bar, 250 jum. i, j, PBS was then added to cardiac 
fibroblasts shown in h and photographs were taken after another 24 h (i) or 48h 
(j) of serum starvation. Scale bar, 250 pm. |, m, Pifithrin-o was added to cardiac 
fibroblasts shown in k and photographs were taken after another 24 h (1) or 
48 h (m) of serum starvation in the presence of pifithrin-«. Scale bar, 250 jum. 
n, Tube length in j and m was expressed as a per cent change from their 
respective control (h and k) (mean + s.e.m., *P < 0.05 compared to control, 
n= 3). 
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Extended Data Figure 9 | RITA decreases inflammatory infiltrate after 
cardiac injury, does not increase apoptosis in myocytes and does not 
enhance mesenchymal-endothelial transition in Colla2-CreERT: 
R26R'ATOMAHE 5 3CKO mice. a-d, Sections of hearts harvested at 3 days after 
cardiac injury were stained for the monocyte/macrophage marker CD68 
(green, arrowheads) in Colla2-CreERT:R26R@70ma9 (a), Colla2-CreERT: 
R26RA7T™*.538° (b) and RITA-injected Colla2-CreERT:R26R'1°"*° 
(c) mice. Scale bar, 10 um. d, Quantification of the number of CD68 cells per 
high-power field in the injury region (mean + s.e.m., *P < 0.05 versus Colla2- 
CreERT:R26R'41°™"°, **P < 0.05 versus Colla2-CreERT:R26R'1°™", 

n = 3). e-g, p53 expression in myocytes after cardiac injury and effect of 
RITA on apoptosis in injury region. e, f, Colla2-CreERT:R26R'"'°™"" mice 
were subjected to ischaemic cardiac injury, hearts harvested at 3 days and 
sections stained for p53 and the cardiomyocyte marker troponin. e, p53 
(green, arrowheads) staining is observed in tdTomato-expressing cells 

(red, arrowheads) but not in cardiomyocytes (blue); merged image shows 
arrowheads pointing to tdTomato-labelled cells expressing p53. Scale bar, 


10 um. f, Higher magnification in injury region demonstrating tdTomato cells 
(arrowheads) expressing p53 (merged, yellow, arrowheads); p53 staining 

is not visible in cardiomyocytes (blue). Scale bar, 10 um. g, TUNEL staining 
and quantification to determine p53~ apoptotic cells after RITA injection 
(arrowheads point to p53*TUNEL}* cells in Colla2-CreERT:R26R'70™"° 
mice (left panel) and RITA-injected Colla2-CreERT:R26R%70™*° mice (right 
panel); inset shows p53" TUNEL" cell in higher magnification (data shown as 
mean + s.e.m., ns, not significant, n = 3). Hearts in both cases were examined 
3 days after injury. Scale bar, 10 jum. h, i, Effect of RITA on mesenchymal- 
endothelial transition in Colla2-CreERT:R26R'47°™"°:p53° mice after 
cardiac injury. h, tdTomato-labelled cardiac fibroblasts expressing VECAD in 
Colla2-CreERT:R26R'"'°™"° mice treated with RITA or vehicle and Colla2- 
CreERT:R26R70™*"0.953CK0 mice treated with RITA or vehicle. Scale bar, 
10 pm. i, Quantification of the percentage of labelled fibroblasts undergoing 
mesenchymal-endothelial transition for each treatment group (mean = s.e.m., 
*P < 0.05, n.s., not significant, n = 4 mice per group). 
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Extended Data Figure 10 | yH2AX expression in cells expressing p53 after Scale bar, 10 pm. b, Immunostaining for p53 (green), YH2AX (blue) and 
ischaemic cardiac injury. Colla2-CreERT:R26R""°™"° mice were subjected — tdTomato (red) to determine co-expression of p53 and yH2AX in tdTomato- 
to ischaemic cardiac injury and immunostaining performed for YH2AX and _labelled cells. Arrowheads point to tdTomato-positive cells co-expressing 
p53. a, Immunostaining for p53 (green), YH2AX (red) and DAPI (blue) in yH2AX and p53. Scale bar, 10 jim. c, Quantification of the fraction of 

region of injury (arrowheads point to nuclei co-expressing YH2AX and p53). tdTomato* p53” cells expressing YH2AX (mean = s.e.m., *P<0.01, n = 3). 
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rystal structure of the PRC1 ubiquitylation 


module bound to the nucleosome 


Robert K. McGinty', Ryan C. Henrici>? & Song Tan! 


The Polycomb group of epigenetic enzymes represses expression of developmentally regulated genes in many eukaryotes. 
This group includes the Polycomb repressive complex 1 (PRC1), which ubiquitylates nucleosomal histone H2A Lys 119 
using its E3 ubiquitin ligase subunits, Ring1B and Bmil, together with an E2 ubiquitin-conjugating enzyme, UbcH5c. 
However, the molecular mechanism of nucleosome substrate recognition by PRC] or other chromatin enzymes is unclear. 
Here we present the crystal structure of the human Ring1B-Bmil-UbcH5c E3-E2 complex (the PRC1 ubiquitylation module) 
bound to its nucleosome core particle substrate. The structure shows how a chromatin enzyme achieves substrate specificity 
by interacting with several nucleosome surfaces spatially distinct from the site of catalysis. Our structure further reveals 
an unexpected role for the ubiquitin E2 enzyme in substrate recognition, and provides insight into how the related histone 
H2A E3 ligase, BRCAI, interacts with and ubiquitylates the nucleosome. 


The Polycomb group (PcG) proteins are transcriptional repressors that 
correlate with diverse human cancers and constitute key epigenetic reg- 
ulators of stem-cell self-renewal and lineage development’*. The PcG 
proteins assemble into multi-component complexes that remodel chro- 
matin structure both directly and through the establishment and removal 
of histone post-translational modifications®’. One of these complexes, 
PRC1, monoubiquitylates histone H2A Lys 119 (refs 8, 9). Once thought 
of as a unique complex, it is now clear that human PRC1 exists as a 
modular subset of complexes with distinct functions and gene targets’”"’. 
All PRC1 complexes share a common core — the RING-type E3 ubi- 
quitin ligase composed of Ring1B (also known as RNF2) or Ring1lA, 
and Bmil (also known as PCGF4) or one of five other Polycomb group 
RING finger (PCGF) paralogues with potential for H2A ubiquitylation. 
This E3 core combines with accessory subunits to form canonical and 
variant PRC1s. Variant complexes, especially those containing histone 
demethylase KDM2B and/or RYBP, seem to be primarily responsible 
for H2A ubiquitylation'®’*""*. 

Protein ubiquitylation is accomplished through a cascade of E1, E2 
and E3 enzymes’». First, an El activates and transfers ubiquitin to an 
E2-conjugating enzyme, forming a thioester between ubiquitin and a 
cysteine in the E2 active site. Then, in an E3-dependent manner, the 
carboxy terminus of ubiquitin is attached to the e-NH, ofa target pro- 
tein lysine side chain through an isopeptide bond. RING family E3 
ligases, such as those found in PRC1, bridge an E2 and a target protein, 
presenting the target lysine to the E2 active site and constraining the 
geometry of the ubiquityl-E2 thioester, priming it for catalysis'‘* *. The 
E3 is considered the primary specificity determinant of ubiquitylation, 
reflecting the scarcity of E2s (~40 proteins in human) relative to E3s 
(more than 600 proteins)””. 

Although the cognate E2 for PRC1 in vivo is not established, the 
Ring1B-Bmil E3 ligase can pair with one of several E2s, including 
UbcH5c (also known as UBE2D3), to ubiquitylate nucleosomes in vitro”. 
The nucleosome core particle (NCP) is the fundamental unit of the 
eukaryotic genome formed by 145-147 base pairs (bp) of DNA wrapped 
around an octameric scaffold of histone proteins (two copies each of 
H2A, H2B, H3 and H4)?!. H2A—-H2B dimers and H2A alone are not 
substrates for Ring] B-Bmil, suggesting that surfaces formed only in the 


context of the nucleosome are required for the E3 ligase activity of PRC1 
(refs 22,23). A minimal RING heterodimer of Ring1B and Bmil exhibits 
comparable activity to the full-length complex, implicating the RING 
domains in both E2 and substrate binding”’. However, we lack a fun- 
damental mechanistic understanding of how the Ring1B-Bmil E3 ligase 
interacts with the nucleosome to direct H2A Lys 119 ubiquitylation, or 
more generally, for how chromatin enzymes recognize their nucleo- 
some substrate. 

We have crystallized the PRC1 ubiquitylation module, an E2-E3 
enzyme complex composed of UbcH5c and the minimal Ring1B-Bmil1 
RING heterodimer, bound to its NCP substrate, and solved the structure 
at 3.3 A resolution (Supplementary Table 1). To our knowledge, this is 
the first crystal structure of a histone-modifying-enzyme—nucleosome 
complex or ofa ubiquitin E2-E3-substrate complex. The structure shows 
that in contrast to most chromatin modification enzymes studied so 
far, the vast majority of interactions between the PRC1 ubiquitylation 
module and the nucleosome occur away from the catalytic site. Ring1B 
and Bmil bind to the histone surface of the NCP to orient UbcH5c for 
H2A ubiquitylation. Surprisingly, UbcH5c also makes direct contacts 
with the NCP that are crucial for nucleosome binding and ubiquityla- 
tion. This presents a novel role of ubiquitin E2 enzymes in substrate 
recognition. 


Overview of complex 


The structure shows one PRC1 ubiquitylation module bound to each 
disk face of the nucleosome (Fig. 1a, b). The two PRC1 modules more 
than double the height of the NCP and with the exception of the 
C-terminal region of UbcH5c, are entirely contained within a cylinder 
projecting up from the NCP disk surface. Each PRC1 ubiquitylation 
module forms a crescent with two halves of the crescent contributed 
by the E2 and E3, respectively. The Ring1 B-Bmil1 E3 heterodimer binds 
to the central histone surface of the NCP. The E2 UbcH5c completes 
the crescent, following the inner curvature of the DNA from its inter- 
face with Ring1B past the DNA end, ultimately terminating above the 
nucleosomal dyad. In this manner, the Ring1B-Bmil E3 ligase posi- 
tions the active site of UbcH5c over the C-terminal tail of H2A near the 
target residue Lys 119 (Fig. 1c). All three components of the PRC1 
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Figure 1 | Crystal structure of PRC1 ubiquitylation module-NCP complex. 
a, View of the complex looking down on the DNA superhelical axis. 

b, Orthogonal view of the complex with proximal and distal halves of the 
structure indicated. c, Zoomed view demonstrating Ring1 B-Bmi1 heterodimer 


ubiquitylation module contact histone and/or DNA components of the 
nucleosome burying 1,470 A? of solvent accessible surface. 

We selected UbcH5c for our structural studies owing to its high activity 
and thorough biochemical and structural characterization with Ring1B- 
Bmil. To overcome the low affinity and salt-sensitive E2-E3 interaction”, 
we genetically fused the C terminus of the Ring1B fragment to the amino 
terminus of UbcH5c, converting the three-polypeptide PRC1 ubiqui- 
tylation module into a stable two-polypeptide version. The two fused 
PRC] ubiquitylation modules in our structure align well with the previ- 
ously determined Ring] B-Bmil-UbcH5c structure” (root mean squared 
deviation (r.m.s.d.) of 1.0 A over all backbone atoms), suggesting only 
minor structural changes after NCP binding and confirming the genetic 
fusion did not markedly alter the structure of the ubiquitylation module 
(Extended Data Fig. la-c, e). Such validation is especially important 
given that the fused PRC1 ubiquitylation module is inactive in a ubi- 
quitylation assay owing to its inability to undergo El-mediated trans- 
thiolation, presumably because the fused E3 clashes with ubiquitin in 
the El adenylation site** (Extended Data Fig. 2a, b). The NCP is simi- 
larly largely unchanged compared to a published NCP structure using 
the same Widom 601 positioning sequence” (r.m.s.d. of 0.7 A over all 
protein backbone atoms) (Extended Data Fig. 1d, e). 

Although the two PRC1 ubiquitylation modules bind their respective 
nucleosome faces in a similar manner, there are differences particularly 
at the UbcH5c-nucleosome interfaces. Alignment of the two sides of 
the structure through the pseudo-two-fold symmetry of the nucleosome 
reveals near uniformity between the histones on the two sides of the 
structure (r.m.s.d. of 0.7 A) (Extended Data Fig. 1f-i). By contrast, the 
PRC1 ubiquitylation modules from the two faces show more variation 
(r.m.s.d. of 1.9 Aand 4.6 A for the E3 and E2 components, respectively), 
resulting from a flexing ofa hinge at the Ring1 B-Bmil-—nucleosome inter- 
face. Regions of the PRC1 ubiquitylation module close to this hinge, 
including the Ring1B-Bmi1l nucleosome-binding loops, are very sim- 
ilar on the opposing surfaces of the structure, whereas regions further 
from the hinge, notably the UbcH5c-nucleosome interfaces, exhibit larger 
differences. As a result, UbcH5c approaches the NCP more closely on 
one side (proximal side) of the structure than the other (distal side). 

Although we cannot rule out the possibility of plasticity in the UbcH5c- 
NCP interface, we suspect that the variation in the two halves of the 
structure result from different environments of each PRC1 module within 
the crystal lattice. This is not merely a result of the post-crystallization 
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positioning UbcH5c active site Cys 85 over the H2A C-terminal tail near the 
H2A Lys 119 Cx atom (orange sphere). Crystals contain the minimal RING 
heterodimer Ring1B(2-116)-Bmil(2-109). Ub, ubiquitylation. 


dehydration used to obtain 3.3 A resolution diffraction, as molecular 
replacement solutions from data collected on less dehydrated crystals 
with larger unit cells also exhibit similar differences (data not shown). 
Because the distal side places UbcH5c too far from the nucleosome sur- 
face to be consistent with our biochemical studies, we favour the prox- 
imal side, which is depicted in the following figures. (Details provided 
below; the distal side and alignments can be found in Extended Data 
Fig. 3.) Importantly, the differences between the two sides of the struc- 
ture influence the extent rather than the presence of interactions at each 
of the PRC1-NCP interfaces (Extended Data Fig. 1)). 


Ring1B-Bmil-histone interactions 

Several basic surfaces of Ring1B and Bmil have been shown to be required 
for nucleosomal ubiquitylation and binding to short duplex DNA”. This 
informed a structural model docking the Ring1B-Bmil heterodimer 
on DNA near the nucleosome dyad, positioning UbcH5c in proximity 
to H2A Lys 119 (ref. 22). However, recent work implicating the H2A- 
H2B acidic patch in Ring]B-Bmil-mediated H2A ubiquitylation in 
vitro and in vivo’ is inconsistent with this docking model. Our struc- 
ture reveals that the Ring] B-Bmil basic residues anticipated to bind 
DNA actually interact with histones surfaces, including the H2A-H2B 
acidic patch. The Ring] B-Bmil RING heterodimer, which contacts all 
four histones, forms a saddle over the N-terminal end of the H2B %C 
helix anchored on each side by Ring] B- and Bmil-histone interactions 
(Extended Data Fig. 3a—c). Ring1B binds to the acidic patch of the H2A- 
H2B dimer. This interaction, which buries nearly 700 A’ of solvent- 
accessible surface on the proximal side, is mediated by an intricate network 
of hydrogen bonds and van der Waals contacts and with few exceptions is 
limited to protein side chains (Fig. 2a and Extended Data Fig. 3d-g, t, u). 
Consistent with published mutagenesis experiments”, H2A Glu 92, 
Ring1B Lys 97 and Arg 98, and to a lesser extent Ring1B Lys 93 contri- 
bute to the Ring1B-nucleosome interface. Ring1B Arg 98 inserts into 
an acidic pocket generated by H2A residues Glu 61, Asp 90 and Glu 92, 
making charged hydrogen bonds with each of the H2A side-chain car- 
boxylates. The depth of this pocket is augmented by a ridge composed 
of H2B residues Glu 105 and His 109, which make van der Waals con- 
tacts with the aliphatic region of the Arg 98 side chain. This model of 
nucleosomal recognition mediated by an ‘arginine-anchor’ binding to 
the H2A-H2B acidic patch is present in all chromatin factor-NCP crystal 
structures published so far’”*° (Extended Data Fig. 4). Ring1B Lys 97 
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Figure 2 | Ring1B-Bmil E3 heterodimer interacts with histone surface. 
a, Ring1B-histone acidic patch interactions in cartoon representation 
with relevant side chains and hydrogen bonds (yellow) highlighted. 

b, Bmil-histone interface shown similarly. c, Coomassie blue stained 

gel of ubiquitylation assay using H2A mutant NCP as indicated. 

d, Quantified nucleosome ubiquitylation assay using indicated Ring1B 


projects into the acidic patch adjacent to Arg 98, hydrogen bonding to 
H2A residues Glu 61 and Glu 64. Not predicted by previous studies, 
Ring1B Arg 81 also binds to the acidic patch forming both side-chain 
and main-chain hydrogen bonds with H2A residues Asp 72 and Asn 89, 
respectively. This second arginine-binding site is distinct from sites 
observed in previous chromatin factor-NCP structures. Further con- 
tacts are mediated by Ring1B residues Lys 65, Lys 85, Lys 93 and Val95. 
With minor exceptions the Ring1B-NCP interface is identical on the 
proximal and distal faces. (Extended Data Fig. 3f, g) The considerable 
contribution of charged side chains to the Ring1 B-nucleosome inter- 
face is consistent with the loss of nucleosomal ubiquitylation by the 
PRC1 ubiquitylation module at higher ionic strength”. 

The structural details of the Ring1B-acidic patch interface are validated 
by our mutagenesis experiments in solution. Tandem alanine mutations 
of H2A Glu 61 and Glu 64 or Asp 90 and Glu 92 (or the combination 
thereof) eliminate all detectable nucleosomal ubiquitylation (Fig. 2c and 
Extended Data Fig. 5a). Nucleosomes containing the H2A Asn68Ala/ 
Asp72Ala double mutant at the secondary arginine-binding site exhibit 
roughly half wild-type activity levels. To assess the effects of muta- 
tions in the PRC1 ubiquitylation module quantitatively, we established 
fluorescence-based nucleosome ubiquitylation and binding assays” 
(Extended Data Fig. 6). The Ring1B Arg98Ala mutation results in a 
50-fold decrease in both nucleosomal ubiquitylation and the affinity of 
the fused PRC1 ubiquitylation module for the nucleosome (Fig. 2d, e). 
Alanine mutations of Ring1B Lys 97, Arg 81 and Lys 93 have similar, 
yet decreasing effects on nucleosome ubiquitylation and binding, whereas 
the Ring1B Lys59Ala mutation away from the binding site has little to 
no effect on PRC1 function. 

In contrast to the large Ring1 B—NCP interface, the Bmil-NCP inter- 
face is less extensive, burying only 330 A®. Bmil forms a cap on the 
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mutants. e, Nucleosome binding curves for fused wild-type and Ring1B 
mutant E2-E3 complexes. Data in d and e are mean and s.d. for each 

data point, n = 3. Fluorescence is normalized (norm.) to fit values for 
unbound and saturated NCP. Concentrations depicted using log scale. WT, 
wild type. 


C-terminal end of the «1 helix of H3 with a Polycomb-specific RING 
domain loop insertion (Fig. 2b and Extended Data Fig. 3h-k, v, w). This 
region was previously implicated in PRC1 function given the reduced 
activity of a Bmil(Lys62Ala/Arg64Ala) mutant”. These residues form 
charged hydrogen bonds with H4 Glu 74 and H3 Asp 77, respectively. 
The intervening Thr 63 side chain inserts into a small cavity created by 
the end of the «1 helix and the neighbouring L1 loop of H3, contributing 
substantial van der Waals contact surface and forming a hydrogen 
bond with the H3 Gln 76 main-chain carbonyl oxygen. An additional 
hydrogen bond is made between Bmil Glu 33 and H2B Lys 108. The 
distal side Bmil-NCP interface shares architectural similarity to its prox- 
imal counterpart with the exception of insufficient density to model the 
Lys 62 side chain (Extended Data Fig. 3), k). 

Mutagenesis of the Bmil-NCP interface allows us to assess the impor- 
tance of this surface relative to the Ring1B-NCP interface. The con- 
sequences of the Bmil(Arg64Ala) mutation are comparable to that of 
Ring1B(Lys93Ala), the least deleterious mutant tested at the Ring1B- 
NCP interface (Extended Data Fig. 7d-f and Supplementary Table 2). 
Meanwhile alanine mutations of His 61 and Lys 62 at the Bmil-NCP 
interface and Arg 45 removed from the interface exhibit near wild-type 
behaviour. A charge reversal mutation of the underlying histone sur- 
face (H3(Asp77Arg)—H4(Glu74Arg)) decreases ubiquitylation by ~80% 
(Extended Data Fig. 5a). This reduction in activity is smaller than that 
resulting from alanine mutations at the Ring] B-NCP interface. Para- 
doxically, a Bmil(Glu33Ala) mutation both markedly decreases nucle- 
osomal ubiquitylation and slightly increases nucleosomal binding affinity. 
We do not fully understand this phenomenon but it may relate to per- 
turbation of the E3 heterodimer given the location of this side chain at 
the Ring1B-Bmil interface. Overall, our studies suggest a dominant role 
for Ring1B in the nucleosome binding of PRC1. The 25-fold enhancement 
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of Ring1B activity by Bmil (refs 9, 20) probably represents the modest 
role for Bmil in NCP binding together with a role in constraining 
ubiquitin at the Bmil-ubiquitin interface’*'’. 

In human PRC1 complexes, Bmil can be replaced by five paralogues 
also known as PCGF proteins’’. The sequence conservation of these 
paralogues combined with our structural and biochemical data suggest 
that each PCGF protein RING domain could support H2A ubiquityla- 
tion (Extended Data Fig. 8a). Bmil Glu 33 is absolutely conserved mirror- 
ing its importance for nucleosomal ubiquitylation. Moreover, Thr 63 
and Arg 64 show sequence similarity, maintaining both overall side- 
chain size and hydrophilicity. Meanwhile, Lys 62, which is dispensable 
for nucleosomal ubiquitylation, exhibits far less conservation and is 
replaced by negatively charged side chains in two of the paralogues. The 
lack of H2A ubiquitylation by canonical PRC1 complexes may reflect 
masking of the E3 interfaces with either the nucleosome or its E2. Alter- 
natively, the nucleosome surface surrounding H2A Lys 119 may become 
inaccessible, secondary to canonical PRC1-mediated chromatin com- 
paction. Further structural characterization of canonical and variant 
PRC1 alone and in complex with oligonucleosomes may shed light on 
this phenomenon. 


UbcH5c-nucleosomal DNA interactions 

Nucleosomal binding by the Ring1B-Bmil E3 ligase minimal RING 
heterodimer positions the E2 UbcH5c with its active site directly over the 
H2A C-terminal tail near the target Lys 119 (Fig. 1c). However, UbcH5c 
scarcely contacts the underlying nucleosomal histone surface. The lack 
of E2 active-site-substrate specific interactions may allow UbcH5c to 
efficiently ubiquitylate several substrates by pairing with different E3 
ligases, although we cannot rule out nucleosome-specific contacts by 
other potential E2s. This contrasts the direct recognition of the target 
lysine and surrounding consensus sequence by the related E2 Ubc9 in 
the SUMO-RanGAP1-Ubc9-Nup358 E2-E3 product crystal structure”. 
On the other hand, UbcH5c does interact with the nucleosome using two 
non-active site surfaces to bind nucleosomal DNA near the DNA end 
between superhelical location (SHL) 6 and 7 and at the nucleosomal dyad 
(Fig. 3a, b and Extended Data Fig. 3l-s). These interfaces vary consid- 
erably on the proximal and distal sides of the crystal structure. At the 
DNA end interface, the UbcH5c antiparallel sheet aligns the side chains 
of His 32, His 55 and Lys 66, directing them towards the DNA backbone 
(Fig. 3a). This allows Lys 66 and His 32 to form hydrogen bonds with 


Figure 3 | UbcH5c binds to nucleosomal DNA enhancing Ring1B-Bmil- 
NCP affinity. a, UbcH5c antiparallel $-sheet-DNA end interactions. 
Important phosphates are coloured pink. The catalytic UbcH5c Cys 85 is 
shown. b, UbcH5c-dyad interactions. The nucleosome dyad nucleotide is 
coloured purple. Arg 72 and Lys 128 side chains not modelled owing to limited 
electron density. c, Fluorescence quenching NCP binding curves for E3 
Ring1B-Bmil alone (grey), fused to E2 UbcH5c (black), or with E2 UbcH5c 
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adjacent DNA phosphate groups on the proximal side of the structure. 

His 55 and Thr 58 side chains also approach the neighbouring phos- 
phodiester backbone (3.6 and 3.9 A, respectively). On the distal side of 
the crystal structure, each of these residues is ~2 A farther from the 
nucleosome surface, limiting direct contact to the Lys 66 side chain that 
lies 4.1 A from the DNA backbone (Extended Data Fig. 3n, 0). In addi- 
tion to these backbone interactions, Asn 81 projects into the major groove 
at the DNA end, although it is unclear in our structure whether base 
specific recognition occurs. 

UbcH5c also contacts the nucleosomal DNA again as its «3 helix 
tracks across the nucleosomal dyad (Fig. 3b). On the proximal side, 
the UbcH5c Arg 125 side chain forms a hydrogen bond with the phos- 
phate in position 1. Although the weaker density in this region of the 
structure prevents modelling of the Lys 128 side chain, the Lys 128 CB 
is only 3.9 A from the phosphate backbone at the dyad, allowing for 
further contacts. A ~3 A translation of the distal side UbcH5 «3 helix 
along its axis results in a minor rearrangement of the distal interface 
(Extended Data Fig. 3r, s). Notably, a hydrogen bond is observed between 
the Lys 128 side chain and the dyad phosphate on the distal side. 

Given the novelty of this E2-substrate interaction, we further exam- 
ined the role of UbcH5c in nucleosomal binding by the PRC1 ubiqui- 
tylation module in solution. We initially noted that the fused PRC1 
complex binds to the NCP with nearly tenfold increased affinity as 
compared to Ring] B-Bmil alone (Fig. 3c and Supplementary Table 2). 
Titration of UbcH5c in trans incrementally increases the affinity of the 
Ring1B-Bmil-NCP interaction, verifying contributions by UbcH5c to 
nucleosomal binding. At 25 uM UbcH5c (~2.5X the E2-E3 dissocia- 
tion constant (Kg) under assay conditions”), the near-saturated, unfused 
PRC1 ubiquitylation module mimics its fused counterpart both in nucle- 
osome affinity and raw fluorescence change, further validating the use 
of the fused complex in the structure and to interrogate effects of muta- 
genesis of all three PRC1 ubiquitylation module components (Fig. 3c 
and data not shown). Importantly, the Ring] B Asp56Lys mutation that 
prevents Ring] B~UbcH5c binding” eliminates affinity changes medi- 
ated by UbcH5c in trans (Supplementary Table 2). 

Having established a role for UbcH5c in nucleosome binding, we 
next addressed the contributions from the DNA end and DNA dyad 
interfaces. The UbcH5c(Lys66Glu) charge reversal mutation decreased 
nucleosomal affinity and ubiquitylation approximately tenfold (Extended 
Data Fig. 7g-i and Supplementary Table 2). A similar trend was observed 


c Kg (uM) 
E3 2.1 +03 
©E3 + 2.5 uM E2 10 +02 
°©E3+5 uM E2 0.50 + 0.03 
eE3 + 12.5 uM E2 0.33 + 0.02 
© £3 +25 uM E2 0.23 + 0.02 
+ 


e Fused E2-E3 
1.2) nE2 ND 


Normalized fluorescence change 


0.01 


0.10 1.0 10 
Concentration (uM) 


added in trans (coloured as shown). E2 UbcH5c alone (open squares) is 
undetectable under assay conditions as it either fails to bind to the NCP or is 
undetectable owing to the location of the fluorescent probe used to monitor 
Ring1B-Bmil binding. Data are mean and s.d. for each data point, n = 3. 
Fluorescence is normalized to fit values for unbound and saturated NCP. 
Concentrations depicted using log scale. 
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with the UbcH5c(Arg125Glu/Lys128Glu) double mutation, although 
the nucleosomal affinity of this mutant complex could not be calculated 
owing to aggregation of the complex at high concentrations under assay 
conditions. The UbcH5c(Thr58Glu) mutant also exhibited significant 
although less severe defects in binding and ubiquitylation, whereas the 
Arg136Glu/Arg139Glu negative control mutant away from the NCP 
surface exhibited wild-type activity. Altogether, these results implicate 
both DNA end and DNA dyad interfaces in PRC1 function. 

The fused PRC1 ubiquitylation module with the UbcH5c(Lys66Glu) 
charge reversal mutant bound less tightly to the nucleosome than the 
Ring1B-Bmil subcomplex alone, suggesting a repulsive effect in addi- 
tion to loss of any native interactions. Analysis of alanine mutations pro- 
vides evidence for the role of UbcH5c side chains in nucleosome binding. 
At the DNA end interface, the UbcH5c(Lys66Ala) mutation caused a 
50% reduction in nucleosome ubiquitylation and a sixfold decrease in 
nucleosome affinity (Extended Data Fig. 7j-1and Supplementary Table 2). 
The neighbouring Arg72Ala and His32Ala/His55Ala mutations also 
showed measurable but more muted effects relative to wild-type pro- 
teins. At the dyad interface, the Arg125Ala/Lys128Ala double mutant 
mimicked the Lys66Ala mutant with a 60% and sixfold reduction in 
nucleosome ubiquitylation and binding, respectively. Individual alanine 
mutations of Arg 125 and Lys 128 caused nearly identical intermediate 
effects. Unlike the charge reversal mutations, all fused complexes bear- 
ing UbcH5c alanine mutations bound to the nucleosome with affinities 
weaker than that of the fused wild-type complex but stronger than the 
Ring1B-Bmil subcomplex alone, implicating the side chains in nucle- 
osome binding. All UbcH5c mutants have similar intrinsic activity as 
measured by lysine aminolysis of ubiquitin-charged E2s”’ (Extended 
Data Fig. 2c). 

Because the variation between the proximal and distal sides of the 
structure is greatest at the UbcH5c-DNA end interface, our functional 
studies present an opportunity to assess the relative authenticity of the 
two sides. For example, the UbcH5c His 32 and His 55 side chains are 
3.3 and 3.6 A from the DNA backbone on the proximal side, and 5.1 
and 7.1 A on the distal side, respectively. Mutation of both histidines to 
alanine decreases nucleosomal affinity 1.5-fold while causing a 30% reduc- 
tion in enzymatic activity. Because the histidine-DNA backbone dis- 
tances on the proximal side of the structure are more consistent with 
our biochemical studies, we favour this side as the better representation 
of the solution state interaction. 

Further inspection of the structure suggests potential for additional 
interactions between UbcH5c and nucleosomal linker DNA (Extended 
Data Fig. 9a, b). The UbcH5c C-terminal «4 helix rests across the major 
groove of linear B-form DNA modelled onto the end of the NCP. The 


Figure 4 | Model of ubiquitylation transition state complex. a, View of 
transition state model looking down on the DNA superhelical axis with region 
around catalytic site highlighted with black circle. Ubiquitin modelled from the 
RNF4-UbcH5a-Ub"* (PDB accession 4AP4) structure on the basis of the 
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DNA-adjacent helical face includes two positively charged and two aro- 
matic side chains. Consistent with this hypothesis, we find that the PRC1 
ubiquitylation module binds to nucleosomes centred on 185 bp 601 
DNA with twice the affinity of those reconstituted with 147 bp 601 DNA 
(Extended Data Fig. 9c). The absence of linker DNA required for these 
interactions in our crystals may explain the apparent flexibility of the 
a4 helices suggested by weaker electron density and higher B-factors. 

In addition to UbcH5c, the Ring1 B-Bmil E3 ligase can also pair with 
UbcH5a (also known as UBE2D1), UbcH5b (UBE2D2) and UbcH6 
(UBE2E1)”°. Meanwhile five other E2s cannot substitute for UbcH5c. 
These functional classes could result from compatibility differences at 
the E2-NCP and/or E2-E3 interfaces. Sequence alignment of E2s sug- 
gests the possibility that multiple positions including Arg 125 combine 
to determine PRC1 complementarity (Extended Data Fig. 8b). 


Nucleosome recognition by BRCA1 

Despite contributions from Bmil and UbcH5c, Ring1B is the major 
contributor to nucleosome recognition as evidenced by buried surface 
area calculations and the magnitude of effects of Ring1B mutations on 
nucleosome binding and ubiquitylation. We wondered whether another 
E3 ligase, BRCA1, recently shown to target nucleosomal H2A Lys 127 
and Lys 129 (ref. 34) specifically, might use a similar nucleosome rec- 
ognition pattern. Much like Ring1B, BRCA1 uses its N-terminal RING 
domain to pair with the BARD1 RING domain to form a heterodimeric 
RING-type E3 ligase”’. Structural and sequence alignment shows a high 
level of conservation between the nucleosome-binding loop of Ring1B 
and the corresponding BRCA1 region, including the Lys-Arg motif 
(Extended Data Fig. 10a, b). We find that disruption of this motif with 
the Lys70Ala/Arg71Ala double mutation eliminates nucleosomal ubi- 
quitylation by the BRCA1-BARD1 RING heterodimer (Extended Data 
Fig. 10c, d). A similar loss of activity was observed after mutation of 
the H2A-H2B acidic patch (Extended Data Fig. 5b). This suggests the 
possibility that like Ring1B, BRCA1 uses basic residues in the putative 
nucleosome-binding loop to bind to the nucleosomal H2A-H2B acidic 
patch. However, as Ring1B and BRCA1 target different H2A residues 
(Lys 119 versus Lys 127/Lys 129), additional BRCA1-BARD1 contacts 
may alter its orientation on the nucleosome. 


Nucleosomal H2A specificity 


The PRC1 ubiquitylation module requires a nucleosome substrate, show- 
ing no activity on an H2A-H2B dimer. Our structure indicates that 
Ring1B, Bmil and UbcH5c interact with nucleosomal H2A-H2B, H3- 
H4 and DNA, respectively, providing a mechanistic basis for nucleo- 
some specificity. We also believe that the nucleosome has an additional 
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alignment of its E2 subunit and UbcH5c from the proximal half of the 
PRC1 ubiquitylation module-NCP structure. b, Side view of tetrahedral 
intermediate model. 
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role in presenting the H2A C-terminal tail to the UbcH5c active site. In 
the context of the H2A—H2B dimer, the H2A C-terminal tail is expected 
to be unstructured beyond residue 105 assigning an entropic cost to 
ubiquitylation of Lys 119 (ref. 36). However, within the nucleosome, H2A 
adopts an elongated structure between residues 105 and 117, resting on 
a platform generated by the underlying H3 surface*". Residues beyond 
this structured region are more flexible, as suggested by the inability to 
model beyond Lys 118 in many NCP crystal structures***°. As such, 
the nucleosome both constrains the H2A C-terminal tail trajectory while 
allowing localized flexibility to reach the UbcH5c active site. 

Despite the large number of lysine residues in the nucleosome, the 
PRC1 ubiquitylation module is specific for H2A Lys 119 and toa lesser 
extent Lys 118. In our structure, electron density is weak after Pro 117 
allowing only main-chain atoms to be built to Lys 119 and Lys 118 on 
the proximal and distal sides, respectively. Neither main-chain trajec- 
tory indicates binding to the UbcH5c active site cleft. To verify that the 
UbcH5c positions within our structure are compatible with catalysis, 
we modelled the tetrahedral ubiquitylation transition state using H2A 
Lys 118 and 119 on each side of the structure (model for proximal side 
Lys 119 shown in Fig. 4). Aside from the tetrahedral centre, ubiquitin 
makes no contact with the nucleosome surface in these models. This echoes 
the lack of SUMO-RanGAP1 interactions in the SUMO-RanGAP1- 
Ubc9-Nup358 E2-E3 product structure’. We find the position of 
UbcH5c on both sides of the structure is consistent with H2A Lys 118 
and Lys 119 ubiquitylation. Moreover, our structure suggests nucleo- 
some ubiquitylation is limited to these two lysines because UbcH5c is 
constrained in close proximity to the nucleosome surface, which pre- 
vents other lysines from entering the active site. Overall, the multifaceted 
interaction of the PRC1 ubiquitylation module with the nucleosome 
and the nucleosomal organization of the H2A C-terminal tail seem to 
combine to effect H2A ubiquitylation. More generally, our structure 
illustrates how a chromatin enzyme can use several non-active site sur- 
faces for nucleosome recognition and specificity determination. We 
anticipate this principle to be shared by many chromatin enzymes that, 
like PRC1, function as part of large multi-component complexes. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Preparation of proteins, protein complexes and NCPs. Complete genes for 
human RING1B (also known as RNF2), BMI1, UbcH5C (UBE2D3) and UBA1 were 
amplified from HeLa complementary DNA. Gene fragments of BRCA1 and BARD1 
containing the RING domains were amplified from HeLa and U2OS cDNA, respec- 
tively. The gene for ubiquitin was synthesized by Invitrogen. UbcH5c(2-147), 
Ubal(2-1058) and ubiquitin(1-76) were each cloned into pST50Tr vectors with 
an N-terminal tandem Strep peptide (STR)-decahistidine (His, 9) affinity tag. STR- 
His,9-Ring1B(2-116)-Bmil (2-109) and STR-His10-BRCA1 (2-103)-BARD1(27- 
130) were each cloned into pST44 vectors for polycistronic co-expression’. A 
similar pST44 vector was created to co-express Bmil(2-109) with Ring1B(2-116) 
fused in frame to UbcH5c(2-147) through a linker encoding the protein sequence, 
GSGSRS. Point mutants were generated by site-directed mutagenesis. The fused 
human PRC1 ubiquitylation module containing Bmil(2-109) and STR-Hisjo- 
Ring1B(2-116)-GSGSRS-UbcH5c(2-157) was expressed in Rosetta(DE3)pLysS 
Escherichia coli at 18 °C. The complex was enriched by metal-affinity chromato- 
graphy with Talon resin (Clonetech), the affinity tag removed with tobacco etch 
virus (TEV) protease and the complex further purified by SourceS cation-exchange 
chromatography (GE Healthcare). The Ring1B(2-116)-Bmil(2-109) complex and 
UbcH5c(2-147) single protein were expressed and purified similarly. STR-His,9- 
Uba1(2-1058) and STR-His)o-ubiquitin(1-76) were expressed in CodonPlus(DE3) 
and BL21(DE3)pLysS E. coli at 18 °C and 37 °C, respectively. Ubal was purified 
by metal-affinity chromatography as above, followed by SourceQ anion-exchange 
chromatography (GE Healthcare) without cleavage of the affinity tag. Ubiquitin 
was purified similarly with the affinity tag intact using SourceS cation-exchange 
chromatography. The STR-Hisj -BRCA1(2-103)-BARD1(27-130) complex was 
expressed in Rosetta(DE3)pLyss E. coli at 18 °C and purified by metal-affinity chro- 
matography, TEV cleavage, and SourceQ anion-exchange chromatography. All 
mutants were expressed and purified identically to their wild-type versions includ- 
ing ion-exchange chromatography. Histone mutants were generated by site-directed 
mutagenesis. Recombinant Xenopus histones were expressed, purified and recon- 
stituted with the 147- or 185-bp Widom 601 sequence* essentially as described 
previously” including anion-exchange purification of NCPs. Fluorescent labelling 
of histones was performed as described*". 

Characterization of proteins and protein complexes. All proteins and complexes 
were evaluated by dynamic light scattering using a Protein Solutions DynaPro instru- 
ment. Scattering was analysed using the Dynamics v6 software package. All E2 and 
E3 mutants showed equivalent monodispersity to their wild-type controls (poly- 
dispersity <20%). Owing to paradoxical binding and activity results, the secondary 
structure of Bmi(Glu33Ala) mutant E3 heterodimer and fused E2-E3 complexes 
were further analysed and compared to wild-type controls using a Jasco J-1500 
Circular Dichroism Spectrometer. Spectra were obtained at 0.1 mg ml’ protein 
concentration in 2mM Tris-Cl, pH 7.6, containing 150 mM NaF. Data were ana- 
lysed using CDPro. 

Complex reconstitution and crystallization. The PRC1 ubiquitylation module 
was reconstituted with NCPs prepared with 147-bp 601 DNA ata 2.8-3.0:1 module: 
nucleosome ratio in 20 mM Tris-Cl, pH 7.6, 75 mM NaCl, 10 1M ZnSO,, 1mM 
dithiothreitol (DTT). The complex was purified by Superdex 200 size exclusion 
chromatography (GE Healthcare) in reconstitution buffer supplemented with 0.2 mM 
phenylmethylsulphony] fluoride (PMSF). Pooled fractions were concentrated to 
~10mgml * by VS500 centrifuge ultrafiltration (Vivascience). Crystallization 
was performed in modified microbatch** by combining 1 1] of the concentrated 
complex with 1 pl 25 mM HEPES, pH 7.5 (Hampton), 80 mM NH,NO; (Hampton), 
and 3% PEG 2000-MME (Fluka) at 21 °C overlayed by 70 kl Al’s oil (1:1 mixture of 
silicon oil (Clearco) and mineral oil (Fisher)). Crystals were collected after 12-60 days. 
Cryoprotection and dehydration was performed by soaking in 25 mM HEPES, 
pH7.5, 120 mM NH4NOs, 2.5% ethanol, 0.5 mM DTT, and 5% PEG 2000-MME 
with increasing amounts of PEG 550-MME (Fluka, 0-28% in 4% increments, every 
10-15 min) at room temperature before flash cooling in liquid nitrogen. 

Data collection, data processing, model building and refinement. Diffraction data 
were collected at Advanced Photon Source’s NE-CAT beamline 24-ID-E (0.9792 A; 
100 K). Data were processed using XDS"! and Scala’. The structure was solved by 
molecular replacement with Phaser** and polyalanine versions of the PRC1 ubiqui- 
tylation module” (PDB code 3RPG) and the NCP with 145-bp Widom 601 DNA” 
(PDB code 3LZ0) as rigid body search models (Phaser final log likelihood gain 
3,700). Crystallographic refinement was performed with iterative automatic refine- 
ment using REFMAC5“**°, PHENIX**, and CNS” together with manual model 
building in COOT™. The structure was refined to 3.3 A resolution with Ryork/Riree 
19.7%/24.5%. The asymmetric unit contains one copy of the 2:1 PRC1 ubiquityla- 
tion module-NCP complex in C222, space group. The final model includes Bmil 
residues 3-107 for chain K, residues 6-102 for chain M; Ring1B residues 15-116 
for chain L, 16-116 for chain N; UbcH5c residues 2-147 for chain L, 2-147 for chain 
N (residues for UbcH5c are offset by 200 in the PDB file (that is, 202-347), linker 
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between Ring1B and UbcH5c in final model residues 117-122 for chains L and N); 
H3 residues 37-134 for chain A, residues 38-134 for chain E; H4 residues 17-102 
for chain B, residues 21-102 for chain F; H2A residues 16-119 for chain C, residues 
12-118 for chain G; H2B residues 31-125 for chain D, residues 31-125 for chain H; 
601 DNA residues —71 to +72 for chain I, residues —72 to +72 for chain J. The 
histone tails are typically not observed in NCP structures. Above average B-factors 
are typical for structures containing the NCP especially at resolutions greater than 
3 A*°-5!_ Ramachandran statistics were analysed with PROCHECK™ (88.4% of 
residues in most favoured regions, 11.5% in additional allowed regions, 0.1% in 
generously allowed regions, 0.1% in disallowed regions — disallowed residue cor- 
responds to Ser 122 to in L chain contained in fusion between Ring1B and UbcH5c). 
Simulated annealing omits maps cumulatively covering the entire structure were 
generated with CNS. All molecular graphics were prepared with PYMOL”. Buried 
surface areas and r.m.s.d. values for structure alignments were calculated using 
PyMOL. 

Ubiquitylation assays. Ubiquitylation assays were performed by mixing 2 1M NCP, 
5 uM STR-His;o-ubiquitin, 30 nM STR-Hisj9-Ubal (E1), 0.375 1M UbcH5c (E2) 
and 0.375 pM Ring1B(2-116)-Bmil (2-109) or BRCA1(2-103)-BARD1 (27-130) 
(E3) in 50 mM HEPES, pH 7.5, 75mM NaCl, 1mM DTT, 10 tM ZnSO,, 2mM 
MgCl, 3 mM ATP. For histone mutants assays, non-fluorescent nucleosomes with 
an STR-His¢-tagged H2B were used. For PRC1 ubiquitylation module mutants, 
wild-type recombinant nucleosomes were doped with 5% nucleosomes containing 
the H2A(Thr10Cys) mutant labelled with Oregon Green 488 maleimide. Assay 
samples ranging from 20 to 80 ull were placed in a water bath for 60 min at 30 °C 
after which assays were quenched with an equivalent volume of 2 SDS-PAGE gel 
loading dye and boiled for 5 min. Assay samples were separated by SDS-PAGE 
using 18% gels, scanned with a Typhoon 9410 imager (GE Healthcare; 488 nm exci- 
tation; 526 nm emission; PMT 500) and stained with Coomassie brilliant blue. 
Quantification of assays was performed using Image Quant software by drawing 
boxes around each band. Each quantified band was background subtracted using 
an identically sized box in a region of the same lane without signal. As only unmod- 
ified H2A, H2Aub1, H2Aub2 and H2Aub3 were observed, the sum of these back- 
ground-subtracted bands was considered to be the total H2A for each lane. In this 
manner, fraction of total H2A was calculated for each band. These fractional values 
could be compared across gels reproducibly (Extended Data Fig. 6). Three tech- 
nical replicates were performed for each mutant with all mutants of a given type 
(that is, all Ring1B mutants) executed simultaneously with both positive and nega- 
tive controls. In most cases, three gels were run each containing one of the three 
replicates of each of the mutants in a given data set. n = 3 was found to be sufficient 
for reproducible data (Extended Data Fig. 6) with low standard deviations. Assays 
for El-mediated transfer of ubiquitin to UbcH5c Cys85Lys mutants were per- 
formed as previously described’®. In brief, 100 1M STR-His;9-ubiquitin, 1 1M 
STR-His}9-Ubal (E1), 80 1M UbcH5c or fused PRC1 ubiquitylation module with 
the Cys85Lys mutant were combined in 50 mM Tris-Cl, pH 10, 150 mM NaCl, 
8mM TCEP, 5 mM MgCl2 and incubated at 35 °C for 24h. Assay samples were 
separated by SDS-PAGE and stained as described above. 

UbcH5c aminolysis assays. To assess intrinsic function of previously uncharac- 
terized UbcH5c mutants, lysine reactivity aminolysis assays were performed basi- 
cally as previously described”. In brief, 20 1M UbcH5c (wild-type or mutant) was 
combined with 20 uM STR-Hisj9-ubiquitin and 0.5 14M STR-His;9-Ubal (E1) in 
80 pl of 25mM sodium phosphate, pH 7.0, 150mM NaCl, 10mM MgCl, and 
10 mM ATP. Following a 20-min incubation at 37 °C, lysine was added to a final 
concentration of 50 mM. Aminolysis time points were taken at 0, 10 and 30 min 
and analysed by SDS-PAGE. Reduction of the no-lysine control was performed 
with 100 mM DTT. 

Nucleosome binding assays. Nucleosome binding assays were performed with 
nucleosomes reconstituted with a H2B(Ser112Cys) mutant labelled with Oregon 
Green 488 maleimide (Life Technologies) with approximately ~50% efficiency and 
147- or 185-bp 601 DNA. Fluorescence quenching assays were performed essen- 
tially as previously described*'. In brief, chromatin binding proteins (wild-type or 
mutant-fused PRC1 ubiquitylation module or Ring] B-Bmil and/or UbcH5c) were 
diluted in 20 mM Tris-Cl, pH 7.6, 100 mM NaCl, 5 mM DTT, 5% glycerol, 0.01% 
NP-40, 0.01% CHAPS, 0.1 mg ml | BSA to concentrations of 50, 32, 16, 8, 4, 2, 1, 
0.5, 0.25, 0.125, 0.062, 0.031 and 0.016 uM. The labelled nucleosome was diluted to 
2 nM in identical buffer containing 50 mM NaCl. Twenty microlitres of NCP was 
added to 20 kl of each protein/complex dilution in triplicate in a passivated 384- 
well plate (Greiner). Following centrifugation, mixing and 10-20-min incubation, 
plates were scanned with a Typhoon 9410 imager (488 nm excitation; 526 nm emis- 
sion; PMT 600). Fluorescence was quantitated with ImageQuant software and data 
was analysed with ProFit (QuantumSoft). For E2 titration experiments, UbcH5c 
was added at twice the indicated concentration to each of the Ring] B-Bmil dilu- 
tions, giving the final indicated concentration following 1:1 mixing with the fluo- 
rescent NCP. n = 3 was found to be sufficient for reproducible data (Extended 
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Data Fig. 6) with low standard deviations. One data point of all the data points 
measured was rejected owing to obvious defects in the microplate well. The fused 
PRC1 ubiquitylation module was most appropriate for measuring affinity constants 
for mutants of all three components because the higher affinity of the fused complex 
(relative to E2 or E3 alone) enabled a larger dynamic range for the assay. This allowed 
the scope of a mutation’s effect to be measured and compared to all other mutants. 
BRCA1-BARD1/Ring1B-Bmil alignment. The BRCA1-BARD1/Ring1B-Bmil 
RING heterodimer alignment was performed using the PYMOL pair_fit command 
with Ca atoms of BRCA1 residues 22-55 and 60-76 and the corresponding Co 
atoms of Ring1B residues 49-79 and 86-102. The alignment gave a r.m.s.d. value of 
2.69 A. The BRCA1 loop containing residues 53-59 (corresponding to Ring1B loop 
residues 80-85) was excluded owing to length discrepancy. Divergent regions adja- 
cent to the RING domains were also excluded. 

Tetrahedral intermediate model building. To construct the tetrahedral model 
for the ubiquitylation intermediate, an initial model for ubiquitin was prepared by 
aligning UbcH5a from the RNF4—UbcH5a-ubiquitin structure’® (PDB code 4AP4) 
with UbcH5c in the PRC1 ubiquitylation module-NCP structure. Side chains were 
built for H2A Lys 118 and Lys 119 allowing the Lys 119 NZ atom to be placed in 
proximity to the adjacent UbcH5c Cys 85 catalytic residue using COOT. A restraint 
library was constructed using JLigand™ converting ubiquitin Gly 76 to ethanola- 
mine and linking its CB atom to both the NZ atom of H2A Lys 119 and the SG atom 
of UbcH5c Cys 85. The geometry of the model was idealized using REFMACS. This 
was repeated for the proximal and distal sides of the structure for H2A Lys 118 and 
Lys 119. Each model resulted in minimal perturbation of the PRC1 ubiquitylation 
module structure outside of H2A Lys 118 and Lys 119 residues. Furthermore, each 
model allows addition of H2A residues C-terminal to Lys 119 without steric clash. 
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Backbone alignment to 
histones in 601 NCP (3LZ0) 


Backbone alignment to 
Ring1B/Bmi1 from 3RPG 


Backbone alignment to 
UbcH5c from 3RPG 


Backbone alignment to PRC1 ub module 
(3RPG) 


UbcH5c 


UbcH5c 


4 Ring1B 


NCP interacting loops 


€ Deviation from apo-structures 


protein(s) rmsd (A)* 
PRC1 ub module 0.98/1.06" 
Ring1B/Bmi1 0.72/0.73t 
UbcH5c 0.98/0.58" 
histones 0.69 


* calculated over all backbone atoms 
t proximal side/distal side 


Bmit 


j Deviation between two halves 


j Solvent accessible buried surface area 


protein(s) rmsd (A)* interface proximal distal 

histones 0.49 Ring1B-NCP 695 A? 565 A2 
Ring1B 1.58 Bmi1-NCP 329 A? 261 A? 
Bmi1 2.11 UbcH5c-NCP 462 A? 247 A2 
UbcH5c 4.56 Ring1B/Bmi1-NCP 1005 A? 810 A? 
Ring1B/Bmi1 1.86 Ring1B/Bmi1/UbcH5c-NCP 1467 A? 1057 A? 


* calculated over all backbone atoms 


Extended Data Figure 1 | Alignment of PRC1 ubiquitylation module and 
NCP with previously determined sub-structures and comparison of the two 
halves of the structure. a, Alignment of PRC1 ubiquitylation model from the 
proximal (darker colours) and distal (lighter colours) halves of the PRC1 
ubiquitylation module-NCP structure with the previously determined 
structure of the PRC1 ubiquitylation module alone” (grey, PDB code 3RPG) 
using all backbone atoms. b, c, A similar alignment using the Ring] B-Bmil 
subcomplex (b) or UbcH5c only (c). d, Alignment of the NCP from the PRC1 
ubiquitylation module-NCP core particle complex (coloured) and the 
previously determined NCP complex” (PDB code 3LZ0) containing the 
identical 601 nucleosome positioning sequence. e, R.m.s.d. for each of the 
alignments calculated over all backbone atoms. f, g, Orthogonal views of an 


alignment of two copies of the complex following rotation of one copy about the 
pseudo-two-fold axis of symmetry of the nucleosome. This was accomplished 
by simultaneously aligning each histone in one copy to its symmetry related 
counterpart in the other copy. For visualization, the PRC1 ubiquitylation 
module from one copy of the structure is depicted with lighter colours. 

h, Zoomed view of the alignment showing PRC1 ubiquitylation module-NCP 
interface. The proximal and distal PRC1 ubiquitylation modules are shown in 
dark and light colours, respectively. Hinge indicated by arrows. i, R.m.s.d. 

for each of the components following alignment of the two halves as described 
above. j, Buried surface area calculations for the indicated interfaces on the 
proximal and distal sides of the structure. 
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a E1-mediated ubiquitin transfer to UbcH5c 
alone but not to fused PRC1 module 
db 
ESS 
$ 
¥ xX 
SF SE 
=~) We 
ATP bat -ub 
67- *Uba1 
45- 

. ~—Expected position of Ring1B-UbcH5c*-ub 
31- +Ring1B-UbcH5c* 
04. *UbcH5c#-ub 

«UbcH5c* 
14- 


eE—— sub/Bmi1 


12 3 4 
*C85K mutant of UbcH5c active cite Cys 


Steric clash of Ring1B and ubiquitin 
bound to the E1 adenylation site 


c 
Intrisinic activity of E2 mutants 
R125E+ R136E+ H32A+ R125A+ 
wild-type _ wild-type = 1T58D K66E K128E  R139E H55A K66A R72A R125A K128A — K128A 
sie -- = Fe CHS GES FHKE Re Se Fee PS Se SS SS SS SF 
min 0 10 30 30'_ 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 0 10 30 
UbcH5c-ub= 
=o 
UbcH5c- 
ub- nga ¢ — -14 


* non-reducible product 
+100 mM DTT added following 30 min incubation without lysine 


Extended Data Figure 2 | Characterization of fused and unfused 

UbcH5c. a, Coomassie-stained gel of El-mediated ubiquitin transfer to 
UbcH5c(Cys85Lys) mutant (UbcH5c*) alone and in the context of the fused 
PRC1 ubiquitylation module. Ubiquitin is transferred by the El Ubal to the 
UbcH5c mutant in an ATP-dependent manner, black arrow (lanes 1 and 2). 
No ubiquitin transfer to the fused PRC1 ubiquitylation module is observed 
(lanes 3 and 4, expected band position for Ring1B-UbcH5c-ubiquitin 
conjugate indicated by a blue arrow). The Cys85Lys mutant was used to assist in 
visualization of the UbcH5c-ubiquitin stable isopeptide conjugates. b, Ring1B 


of the fused PRC1 ubiquitylation module (blue) clashes with ubiquitin in the E1 
Ubal adenylation site (green). Alignment of UbH5c subunit of PRC1 
ubiquitylation module” (PDB code 3RPG) to Ubc4 in Ubc4-Uba1-ubiquitin 
structure” (PDB code 4II2). Ubc4 and Ubal are shown in pink and grey, 
respectively. ¢, Intrisic activity of wild-type and mutant UbcH5c enzymes. 
Time-course of UbcH5c-ubiquitin thioester aminolysis after lysine addition. 
Reactions quenched with non-reducing loading buffer unless 100 mM DTT 
addition is indicated (dagger). Asterisk denotes a non-reducible product. 
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Ring1B binding 


RingiB 
K85,< 


“proximal proximal 
Extended Data Figure 3 | Comparison of the proximal and distal Ring1B-, 
Bmil- and UbcH5c-nucleosome interfaces. a—c, Identical views of 
Ring1B-Bmil saddle from proximal (a) and distal (b) halves of structure and 
overlay (c). d-g, Ring1B-histone interface views, including zoom out (d) with 
box indicating field of view depicted in magnified panels from proximal (e) 
and distal (f) halves of structure and overlay (g). h-k, Similar views of 
Bmil-histone interface. l-s, Similar views of UbcH5c-DNA interfaces. 
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dyad binding 


proximal 


distal 


overlay 


Ring1B im 
K85 39 


distal distal 


t-u, 2mF,, — DF, difference Fourier transform electron density maps for the 
Ring1B-nucleosome interface contoured at 1.0¢ on the proximal (t) and 
distal (u) sides of the PRC1 ubiquitylation module-NCP structure. 

v, w, Similarly contoured electron density map for the proximal (v) and 
distal (w) Bmil-nucleosome interface. All views are identical to those depicted 
in Figs 2 and 3. 
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CENP-C 


Extended Data Figure 4 | Chromatin factors use an arginine-anchor to bind — code 3TU4; ref. 29) (e) each bound to the acidic patch. Conserved arginine is 
the H2A-H2B acidic patch. a-e, Identical views of E3 ligase Ring1B (blue) shown in space-filling representation. Other arginines involved in binding are 
(a), viral LANA peptide (purple, PDB code 1ZLA; ref. 27) (b), chromatin factor shown as sticks. UbcH5c and the Ring1B loop between residues 81 and 98 
RCC1 (purple, PDB code 3MVD; ref. 28) (c), centromeric protein CENP-C are not shown for figure clarity. 

(green, PDB code 4INM; ref. 30) (d) and silencing protein Sir3 (brown, PDB 
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a b 
Nucleosome ubiquitylation by Ring1B/Bmi1 Nucl. ubiquitylation by BRCA1/BARD1 
\ a \e 
s s 
So fe 
Sy ss 
SK Fe vy 
gowye a ww 
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Q 6 © DGD © A © 
- eee é 
A 
es vVvVvVve 
45- —-H2Aub2 —H2Aub2 
31- -H2Aub1 -H2Aub1 
24. ri ead —— | giegued-H2p 
14 *H2A/UbcH5c “| *H2A/UbcH5c 
| *H4/ub/Bmi1/Ring1B +H4/ub/BRCA1/BARD1 
423 4°55 6 7 8 9 10: 11 72 
Extended Data Figure 5 | Ring1B-Bmil- and BRCA1-BARD1-mediated ubiquitin and Ring1B-Bmil (a) or BRCA1-BARD1 RING heterodimers (b). 
ubiquitylation of nucleosomes containing histone mutations. Tagged H2B (STR-Hisg) was used to prevent electrophoretic comigration with 


a, b, Coomassie-stained gels of ubiquitylation assays using nucleosomes with | unmodified H2A. Experiment was repeated twice. 
the specified histone mutants, E1 Ubal, E2 UbcH5c, STR-Hisy9 tagged 
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Fluorescence-based nucleosome ubiquitylation assay 


H2A (unmod.) 
! H2Aub3 
Hi H2Aub2 
WB H2Aub1 


Extended Data Figure 6 | Fluorescence-based nucleosome ubiquitylation 
and binding assays. a, Fluorescence-based nucleosome ubiquitylation assay. 
Left, ubiquitylation assays performed with nucleosomes labelled with Oregon 
Green 488 maleimide on H2A(Thr10Cys) mutant (H2A T10C-OG488). 
Right, replicates 1, 2 and 3 of wild-type assay run on four different gels and 
quantified to demonstrate reproducibility across gels. Data are mean and s.d., 
n= 4. b, Fluorescence-based nucleosome binding assay. Top left, nucleosomes 


b Fluorescence quenching nucleosome binding assay 
—PRC1 = not quenched 


+ PRC1 = quenched 


Kd (uM) 
0.23 + 0.02 
0.234 + 0.008 


0.21 + 0.01 


Normalized fluorescence change 


0.01 
concentration (uM) 


0.10 1.0 10 


labelled with Oregon Green 488 on H2B(Ser112Cys) mutant (H2B $112- 
OG488). Top right, binding of PRC1 ubiquitylation module leads to partial 
quenching of the fluorophore allowing affinity measurements to be made. 
Bottom, three technical replicates of fused wild-type PRC1 ubiquitylation 
module are shown to demonstrate reproducibility. Mean and s.d. shown for 
each data point, n = 3. 


©2014 Macmillan Publishers Limited. All rights reserved 


Nucleosome ubiquitylation 


—-H2Aub1 
H3 UbcH5c 
TH AH 2B/ 
-H4/ub/Bmi1/ 
Ring1B 
23 46 6 ¥ 4 BS 4 BS 6 FF 
Coomassie Fluorescence 
e 
x 
Ss 
-H2Aub3 
-H2Aub2 
-H2Aub1 
+H3 UbcH5c 
*+H2A/H2B/ 
—-H4/ub/Bmi1/ 
Ring1B 
i283 42465 6 Ff 4 2S 42 8 GF 0 
Coomassie Fluorescence 


9g 


9 & 
se Oo WV 
urctsc & FE ES 
ATP — + + + 


21- 
soe 
14-| a ee 
{ 2@Q@Ss 4h 6 1 2S 4 5 6 
Coomassie Fluorescence 
x 
: £ 
j = k 
A&A AW AW 
ubchc $< $L L$s 
++ 
-H2Aub3 
-H2Aub2 
———|-H2Aub1 


H3 UbcH5c 
|*H2A/H2B/ 
—H4/ub/Bmi1/ 
Ring1B 
123 4567891012345 67 8910 


Coomassie Fluorescence 


Extended Data Figure 7 | Effects of PRC1 ubiquitylation module mutants 
on nucleosome ubiquitylation and binding. a, Representative gel of one 
replicate of ubiquitylation assay using E1 Ubal, UbcH5c, STR-Hisj9 tagged 
ubiquitin, NCPs, and E3 Ring1B-Bmil with the indicated Ring1B mutants 
stained with Coomassie (left) and scanned for fluorescent H2A (right). NCPs in 
the experiment are doped with NCPs containing the H2A(Thr10Cys) mutant 
labelled with Oregon Green 488 maleimide. b, Quantitation of mono- 
ubiquitylated (H2Aub1, dark blue), di-ubiquitylated (H2Aub2, blue) and 
tri-ubiquitylated (H2Aub3, light blue) H2A are shown as a fraction of total 
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H2A. Data are mean and s.d. from three technical replicates. Samples from the 
same experiment were analysed on different gels, processed in parallel. 

c, Fluorescence quenching binding curves for fused PRC1 ubiquitylation 
modules containing the indicated mutations of Ring1B (coloured as shown). 
Mean and s.d. are shown for each data point, n = 3. Fluorescence is normalized 
to fit values for unbound and saturated NCPs. Concentrations depicted 
using log scale. d-1, Experiments as described above for indicated Bmil 
mutants (d-f), UbcH5c charge reversal mutants (g-i) and UbcH5c alanine 
mutants (j-l). Triplicate ubiquitylation assays were repeated at least twice. 
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Extended Data Figure 8 | Alignment of Bmil paralogues and E2s. 


a, Sequence alignment of segments of Bmil with PCGF (Polycomb group 
RING finger) paralogues also found in PRC1. b, Sequence alignment of 


UbcH5c and other E2s known to be active (green) or inactive (red) or of 


unknown compatibility with Ring1B-Bmil-mediated nucleosomal 


ubiquitylation. Key residues discussed in the text are indicated with an asterisk. 
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Extended Data Figure 9 | Linker DNA increases the affinity of the PRC1 
ubiquitylation modules for the nucleosome. a, Linear B-form duplex DNA 
modelled onto the DNA ends of the PRC1 ubiquitylation module-NCP 
structure. b, The UbcH5c «4 helix occupies the major groove of modelled linker 
DNA. Several basic and aromatic side chains line the DNA-adjacent face of the 
a4 helix. c, Linker DNA enhances nucleosomal binding of fused PRC1 
ubiquitylation module. Fluorescence quenching binding curves for the 
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Kd (uM) 
e 147 bp DNA repl. 1 0.23 + 0.02 
© 147 bp DNA repl. 2 0.234 + 0.008 
¢ 185 bp DNA repl. 1 0.126 + 0.004 
¢ 185 bp DNA repl. 2 0.127 + 0.004 


Normalized fluorescence change 


0.01 


concentration (uM) 


0.10 1.0 10 


wild-type fused PRC1 ubiquitylation module binding to nucleosomes centred 
on 147-bp (black, grey and light grey) or 185-bp (blue and light blue) 601 DNA. 
Technical replicates performed on different days are depicted in different 
shades of grey and blue. Mean and s.d. are shown for each data point, n = 3. 
Fluorescence is normalized to fit values for unbound and saturated NCPs. 
Concentrations depicted using log scale. 
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Extended Data Figure 10 | BRCA1 requires similar loop region and 
H2A-H2B acidic patch for nucleosome ubiquitylation. a, Alignment of the 
BRCA1-BARD1 and Ring1B-Bmil heterodimers using the RING domains of 
BRCA1 from the BRCA1-BARD1 NMR structure** (PDB code 1JM7) and 
Ring1B. BRCA1 and BARD] are shown in orange and purple, respectively. 
Ca atoms of essential arginine residues are indicated by spheres. b, Sequence 
alignment of Ring1B and BRCA1 shows conserved nucleosome interacting 
loop. c, BRCA1(Lys70Ala/Arg71Ala) mutation eliminates E3 ligase activity of 
BRCA1-BARD1 RING heterodimer. Representative gel of one replicate of 


BRCA1(61-74) 
Ring1B (87-101) 


nucleosome 
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ubiquitylation assay using El Ubal, E2 UbcH5c, STR-His)9 ubiquitin, NCPs, 
and E3 BRCA1-BARD1 with the indicated mutants stained with Coomassie 
and scanned for fluorescent H2A. NCPs in the experiment are doped with 
NCPs containing the H2A(Thr10Cys) mutant labelled with Oregon Green 488 
maleimide. d, Quantification of mono-ubiquitylated (H2Aub1, dark orange), 
di-ubiquitylated (H2Aub2, orange) and tri-ubiquitylated (H2Aub3, light 
orange) H2A are shown as a fraction of total H2A. Mean and s.d. from three 
technical replicates are depicted. Ubiquitylation assay was repeated twice. 
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Spatially resolved magnetic field structure in the disk 


of a T Tauri star 


Jan W. Stephens’, Leslie W. Looney’, Woojin Kwon’, Manuel Fernandez-Lopez”*, A. Meredith Hughes’, Lee G. Mundy?®, 


Richard M. Crutcher’, Zhi-Yun Li’ & Ramprasad Rao® 


Magnetic fields in accretion disks play a dominant part during the 
star formation process'” but have hitherto been observationally poorly 
constrained. Field strengths have been inferred on T Tauri stars* 
and possibly in the innermost part of their accretion disks*, but the 
strength and morphology of the field in the bulk of a disk have not 
been observed. Spatially unresolved measurements of polarized emis- 
sion (arising from elongated dust grains aligned perpendicularly to 
the field’) imply average fields aligned with the disks®’. Theoretically, 
the fields are expected to be largely toroidal, poloidal or a mixture of 
the two'**"°, which imply different mechanisms for transporting an- 
gular momentum in the disks of actively accreting young stars such 
as HL Tau (ref. 11). Here we report resolved measurements of the 
polarized 1.25-millimetre continuum emission from the disk of HL 
Tau. The magnetic field on a scale of 80 astronomical units is coin- 
cident with the major axis (about 210 astronomical units long’”) of 
the disk. From this we conclude that the magnetic field inside the disk 
at this scale cannot be dominated by a vertical component, though a 
purely toroidal field also does not fit the data well. The unexpected 
morphology suggests that the role of the magnetic field in the accre- 
tion of a T Tauri star is more complex than our current theoretical 
understanding. 

HL Tau is located 140 pc away”, in the Taurus molecular cloud. Al- 
though HL Tau is a T Tauri star, it is considered to be in an early stage 
of development owing to its bipolar outflow“ and possible residual en- 
velope’*. Observations and modelling of this protostar assuming a thick, 
flared disk suggest a stellar mass of ~0.55 times the solar mass (Mo) 
and a disk mass of 0.14M qq (ref. 12). A possible planet forming in the 
disk of HL Tau has been observed'®, though this detection was not 
confirmed!”. However, the disk of HL Tau is gravitationally unstable, 
which could favour fragmentation into Jupiter-mass planets’*'®. HL Tau 
has the brightest disk of any T Tauri star at millimetre wavelengths, al- 
lowing observations of the fractional polarization (P) to have the best 
possible sensitivity. Previous observations of the polarization of the disk 
of HL Tau yielded marginally significant, spatially unresolved polariza- 
tion detections with the James Clerk Maxwell Telescope (JCMT; polariza- 
tion P = 3.6 + 2.4% at 14” = 1,960 astronomical units (AU) resolution)® 
and the Submillimetre Array (SMA; polarization P = 0.86 + 0.4% at 
2” = 280 AU resolution, archival observations released in this Letter). In 
addition, observations of HL Tau with the Combined Array for Millimeter- 
wave Astronomy (CARMA) have shown that the interferometric emis- 
sion comes entirely from the disk with no contamination from large-scale 
envelope emission”. HL Tau is therefore a very promising source to 
search for a spatially resolved polarization detection. 

Only through observations of polarized dust emission can the mor- 
phology of the magnetic field be ascertained; however, higher-resolution 
dust polarimetric observations of T Tauri star disks do not detect polar- 
ization and place stringent upper limits (P< 1%) on the polarization 
fraction’”’* that disagree with theoretical models ofhigh efficiency grain 


alignment with a purely toroidal field*. There is a clear discrepancy 
between theoretical models of the magnetic fields in disks and the ob- 
servations to date, requiring more sensitive observations of the dust 
polarization. The SMA recently detected the magnetic field morpho- 
logy in the circumstellar disk of the Class 0 (that is, the earliest proto- 
stellar stage) protostar IRAS 16293-2422 B”, but since this disk is nearly 
face-on, observations cannot detect the vertical component of the mag- 
netic field. Moreover, this source is perhaps one of the youngest of the 
known Class 0 sources”’, increasing the chances that the polarized flux 
could be from the natal environment. Nevertheless, since the disk is the 
brightest component at the probed scale, polarization most probably 
comes from the disk, and the magnetic field morphology hints at toroi- 
dal wrapping”. 

We obtained 1.25-mm CARMA polarimetric maps of HL Tau at 0.6” 
(84 aU) resolution and plot the magnetic field morphology in Fig. 1. 
This is a spatially resolved detection (with approximately three inde- 
pendent beams) of the magnetic field morphology in the circumstellar 
disk of a T Tauri star. The central magnetic field vector has a measured 
position angle (PA, measured anticlockwise from north) of 03 = 143.6 
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Figure 1 | Detected magnetic field morphology of HL Tau at 0.6” resolution. 
The polarization vectors (short coloured lines) have been rotated by 90° to show 
the inferred magnetic field orientation. Red vectors are detections >30p 
while blue vectors are detections between 2¢p and 3p, where op is the 

r.m.s. noise of P. We do not show vectors when the signal-to-noise ratio for 
Stokes I is below 2. The sizes of the vector are proportional to the fractional 
polarization, P, with the red scale bar corresponding to P = 1%. Stokes I 
contours are shown for [—3, 3, 4, 6, 10, 20, 40, 60, 80, 100] X o;, where 

0, = 2.1 mJy per beam and is the r.m.s. noise of I. The dashed line shows the 
major axis of PA = 136° (ref. 12). The synthesized beam is shown at bottom 
right and has a size of 0.65” X 0.56” and PA = 79.5°. Dec., declination; RA, 
right ascension. 
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+ 4.4°, which is within 9° of the previously measured PA of the major 
axis of the HL Tau disk’*. This angle is in agreement with that measured 
by the JCMT (03 = 140 + 20°)° and archival observations from the SMA 
analysed here (03 = 137 + 13°). The central vector has a fractional po- 
larization of P = 0.59 + 0.09%, and P varies over the disk between 0.54 + 
0.13% and 2.4 + 0.7% with an average and median of 0.90% and 0.72% 
respectively, in agreement with the upper limits (P< 1%) of other T 
Tauri star polarimetric observations'”’*. This median value is signifi- 
cantly less than in IRAS 16293-2422 B (1.4%), which could indicate 
disk evolution; the dust grains could be growing larger and becoming 
more spherical with time, and/or the magnetic field is becoming more 
turbulent. 

To constrain the intrinsic magnetic field configuration inside the disk 
of HL Tau, we made a simple model which incorporates a combination 
of a toroidal and a vertical component; any radial field component in- 
side the bulk of the disk (most probably probed by our polarization 
observation) is expected to be sheared quickly into a toroidal config- 
uration by differential rotation on the short timescale of the disk rota- 
tion. We use the best estimates of disk parameters'* (PA = 136° and 
inclination i = 40°, both accurate to within a few degrees) and vary the 
relative strength of the two field components from 100% toroidal and 
0% vertical to 0% toroidal and 100% vertical in 1% increments (see 
Methods for modelling details). Using the =3o detections in Fig. 1, we 
find that a completely toroidal field is the best-fit model. However, the 
reduced ¢ value is high (69, where a value of 1 indicates an optimal fit). 
If we do not constrain the magnetic field to be aligned with the best-fit 
disk” we can achieve a better fit. For a disk with a PA of 151°, the best- 
fit model to the observations is almost completely edge on, with an in- 
clination of i = 89° (reduced y value of 1). Figure 2 shows the observed 
and the modelled parameters for a PA of 136° and i = 40° as well as a 
PA of 151° andi = 89°. Although a PA of 151° is not very different from 
the major axis of the disk (15° difference), i = 89° is inconsistent with 
our disk dimensions and the well-constrained continuum observations 
of other studies'**". There is also no detected polarized flux in the north- 
east (upper-left) and southwest (bottom-right) of the disk where P for 
a toroidal field should be highest due to less beam smearing. This fact, 
along with the poor fit with the constrained disk parameters, indicates 
that although there is probably a toroidal component, the field appar- 
ently has substantial contributions from other components. 

We note that adding a vertical component to all our models makes 
the fit worse. This lack of any vertical component in our model suggests 
that dominant poloidal fields are probably absent inside the disk because, 
as we argued earlier on dynamical grounds, a predominantly radial field 
is unlikely inside the disk because of rapid differential rotation. More- 
over, although points with >5c polarization detections fit a toroidal mor- 
phology even for the i = 40° model, we cannot be sure that the disk field 
is only toroidal; this is because a straight, uniform field in the plane of the 
disk, although physically unlikely for a disk system, also fits the data well. 
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Figure 2 | Observed magnetic field 
morphology compared with 
models. a, 100% toroidal field model 
with PA = 136° and i = 40°. b, As 

a but with PA = 151° and i= 89°. 
The observed 30 and 5a detections 
are shown in red and orange, 
respectively, while modelled vectors 
are shown in yellow. Black contours, 
colour scale and beam size are the 
same as Fig. 1. 
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If the disk of HL Tau has a dominant toroidal component, then it is 
uncertain what causes the outer vectors to not fit the toroidal morpho- 
logy based on the best-fit disk dust model’? (Fig. 2a). Perhaps the grains 
in some parts of the disk do not efficiently align with the disk field. An- 
other possibility is that the magnetic field towards the inner disk is toroi- 
dal and beam-averaged to be aligned with the major axis of the disk. 
However, towards the edge of the disk, where the disk field may be less 
tightly wound and weaker, the magnetic field may be dominated by ex- 
ternal field lines that are already toroidal (for example, due to a rotating 
envelope). The incoming fields may strongly influence or even domi- 
nate the magnetic field in the outer disk. In any case, the discrepancy 
indicates that, at least for this particular source, one needs to go beyond 
the simplest magnetorotational instability” disk models that do not in- 
clude any external influence or distortion. Both theoretical studies tai- 
lored to HL Tau and higher-resolution polarization observations are 
needed to resolve this puzzle. 

At the 1,000 au scale, structured magnetic fields are observed around 
young, low-mass protostars**”’, but the fields appear to be randomly 
aligned with respect to the inferred disk****. Misalignment of the field 
lines with the rotation axis can help overcome magnetic braking to create 
acentrifugally supported disk at the 100 au scale’’. Further disk evolu- 
tion can be driven by magnetorotational instability” or a magnetocen- 
trifugal wind’”; a toroidally dominant disk field is expected in the former 
scenario, and a significant poloidal field is required for the latter (at least 
near the wind launching surface). Both processes can possibly contrib- 
ute to the disk accretion at the same time. 

Until now, we have been unable to observationally constrain the mag- 
netic field morphology in disks. Along with the Class 0 source IRAS 
16293-2422 B, our observations of HL Tau show that a toroidal field 
component may last from the low-mass protostar’s initial formation 
to the T Tauri star stage—approximately the first 10° years of a proto- 
star’s life’°. The apparent absence of vertical fields in these observations 
implies that magnetocentrifugal winds driven along large-scale poloi- 
dal magnetic fields” are probably not the dominant mechanism for re- 
distributing the disk’s angular momentum during the accretion process 
ofa star at the 80 aU size-scale of our observations. However, the mor- 
phology detected in HL Tau also cannot be fully explained by a simple 
mix of toroidal and vertical components, requiring future observations 
at both large scale and small scale to truly understand the role of mag- 
netic fields in the formation of solar systems like our own. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Data reduction. The CARMA dual polarization receivers allow for the measure- 
ment of polarized dust emission. Polarimetric maps can thus be created, which pro- 
vide the flux density (Stokes J), the PA of the dust polarization, and P at every point 
within the map. CARMA continuum observations in Full Stokes mode were taken 
at 237 GHz in two different resolutions with each resolution having multiple tracks. 
The C-array observations (~0.79" resolution) consisted of four tracks in October 
and November 2013 while the B-array observations (~0.37” resolution) consisted 
of three more tracks in November and December 2013. 

We have reduced the CARMA observations using the MIRIAD package”*. The 
dual-polarization receivers of CARMA measure right- (R) and left- (L) circular polar- 
ization and the four cross-polarization (RR, LL, LR, RL) terms. In order to calibrate 
CARMA Full Stokes observations, apart from the usual interferometric calibrations 
(bandpass, phase and flux corrections), two additional calibrations are required: 
XYPhase (due to L and R channel phase differences) and leakage (due to L and R 
channels cross-coupling). These calibrations were done in the typical manner for 
CARMA”. The leakage terms for each antenna were consistent from track to track, 
and the overall accuracy of the leakage calibrations are expected to be approximately 
0.1%. Day to day consistency in polarization observations was tested by measuring 
the phase calibrator, 0510+ 180. The polarization angle changed slightly from day 
to day, with a steady increase of a few degrees with each newer track. Intraday var- 
iability is a well-known phenomenon which affects the total flux density, the line- 
arly polarized flux density, and the polarization angle” and probably explains the 
variations of a few degrees seen in 0510+ 180 from track to track. Since the variation 
of the polarization in 0510+ 180 was not very large, the consistency of polarization 
measurements of 0510+ 180 between tracks made us confident that our calibration 
is accurate. For B-array tracks, we also saw that the bandpass calibrator, 3C454.3, 
showed consistency for polarization measurements for all the tracks. Other cali- 
brators observed did not have polarization detected, signifying that our polariza- 
tion detection of HL Tau is not a spurious detection. We also note that there was a 
slight difference in the polarization angle between the lower and upper sideband; this 
difference may be due to Faraday rotation and was almost constant for all tracks. 

For bandpass calibration, C-array observations used 3C84 and B-array observa- 
tions used 3C454.3. The phase calibrator used for both arrays was 0510+ 180. Obser- 
vations of MWC349, with an adopted flux of 2.1 Jy (ref. 30), provided the absolute 
scale for the flux calibration at 237 GHz in most of the tracks. The bootstrapped 
flux of 0510+ 180 using MWC349 was consistent within 10% and 15% with boot- 
strapped fluxes using Mars and Uranus, respectively, in other tracks. Since planets 
are resolved at these resolutions, MWC349 is likely to have a more accurate flux cal- 
ibration and was bootstrapped for all tracks. The absolute flux uncertainty is esti- 
mated to be 15%, but only statistical uncertainties are discussed in this work. When 
imaging, natural weighting was used to maximize the sensitivity. 

Detections of polarization were found in every HL Tau track with consistent polar- 
ization angles and measurements. Since polarization is calculated from Stokes Qand 
U and can only have positive values, there exists a bias in the polarized intensity; 
hence, all our polarization measurements have been de-biased”’. The sensitivity in 
Stokes Tis limited by dynamic range rather than the flux sensitivity of the observa- 
tions. The uncertainty in the absolute position angle of CARMA is approximately 
3° (ref. 25). 

These high resolution interferometric observations are insensitive to large scale 
structure. Single dish and interferometric HL Tau observations find very similar 
compact fluxes, suggesting that the envelope dust continuum is negligible”’, and the 
flux appears to come entirely from the disk'?. 

Also reported in this study are unpublished polarization observations (PA and P) 
of HL Tau from the SMA. These observations were taken in the compact configuration 


in October 2005 in the 345 GHz atmospheric window with the Local Oscillator 
tuned to 341.5 GHz. The polarization data reduction process was done in the typ- 
ical manner employed by the SMA”. 

Linear polarization modelling. We employed a flared viscous accretion disk model 
that was constrained by high angular resolution data and a broad spectral energy 
distribution, detailed in another paper’”. The accretion disk model has a power-law 
density distribution with an exponential tapering, and the vertical density distri- 
bution is assumed to be 1.5 times thicker than the hydrostatic equilibrium case. The 
temperature distributions are calculated by interpolation of two power-law distri- 
butions at the cold mid-plane, comparable to the results of the Monte-Carlo radi- 
ative code RADMC-3D” and at the surface based on radiation equilibrium: T,,[K] = 
190(r/Au) °* and T,[K] = 600(r/Au) °*8, where Ti, and T, are the temperatures 
of the disk at the mid-plane and surface, respectively. The disk parameters employed 
were the volume density power-law index, taken as 1.064, the dust opacity spectral 
index B = 0.729, the disk mass Mg = 0.1349M 6, the inner radius R;,, = 2.4 AU, and 
the characteristic radius R. = 78.9 Au. As described in the main text, we investigated 
various cases of different inclination and position angles. 

Our polarization modelling consists of toroidal and vertical magnetic fields. In- 
stead of constraining the detailed morphology of magnetic fields, we intended to 
constrain which morphology is preferred. In order to achieve this goal, we exam- 
ined 101 cases spanning relative polarization fractions of the two orthogonal fields 
in steps of 1% (that is, 100% toroidal, 99% toroidal and 1% vertical, 98% toroidal 
and 2% vertical, ...). For constructing linear polarization information, we built Stokes 
I, Qand U maps by numerically solving radiative transfer (necessary for a thick 
disk). In individual integral elements of radiative transfer along the line of sight, 
we compute the intensity for Q and U. The fractional intensities added up to the Q 
and U maps by an integral element are: 


AQ=AI fp (for tor +fuerQver) (1) 


AU= AI fo(ftorMtor + frertlver) (2) 


where f, is a total polarization fraction (v Q+U? / I ) » ftor and fre, are relative 
fractions of the toroidal and vertical fields (for example, fior = 0.7 and frer = 0.3 for 
70% toroidal and 30% vertical fields), and gio, and Uo, are COS(2 Zor) and sin(2Yjor) 
respectively. 7,0, is the angle, y, of the toroidal magnetic field measured anticlock- 
wise from the north. Similarly, qye, and Uye, for vertical fields are cos(27%ye,) and 
sin(27ver), Where Yyer is the angle of the vertical magnetic field. Since the disk is 
optically thin and we are only concerned with the morphology, f, can be given an 
arbitrary value (for example, 0.01 or 0.1). Note that the toroidal and vertical mag- 
netic field vectors at each integral element have been tilted and rotated based on 
the inclination and the position angle of the model disk, before the calculation of 
the fractions. Qand U maps are convolved with the synthesized beam from the po- 
larization observations, and the modelled position angles of the magnetic field mor- 
phology are created using y = 0.5 tan” !(U/Q). 
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Possible planet formation in the young, low-mass, 
multiple stellar system GG Tau A 


Anne Dutrey'?, Emmanuel Di Folco!?, Stephane Guilloteau?, Yann Boehler’, Jeff Bary’, Tracy Beck”, Hervé Beust®, 
Edwige Chapillon’”, Fredéric Gueth’, Jean-Marc Hure!”, Arnaud Pierens!?, Vincent Piétu’, Michal Simon® & Ya-Wen Tang? 


The formation of planets around binary stars may be more difficult 
than around single stars’’. In a close binary star (with a separation 
of less than a hundred astronomical units), theory predicts the pres- 
ence of circumstellar disks around each star, and an outer circum- 
binary disk surrounding a gravitationally cleared inner cavity around 
the stars**. Given that the inner disks are depleted by accretion onto 
the stars on timescales of a few thousand years, any replenishing mate- 
rial must be transferred from the outer reservoir to fuel planet for- 
mation (which occurs on timescales of about one million years). Gas 
flowing through disk cavities has been detected in single star systems°. 
A circumbinary disk was discovered around the young low-mass bi- 
nary system GG Tau A (ref. 7), which has recently been shown to bea 
hierarchical triple system®. It has one large inner disk’ around the 
single star, GG Tau Aa, and shows small amounts of shocked hydro- 
gen gas residing within the central cavity’’, but other than a single 
weak detection”, the distribution of cold gas in this cavity or in any 
other binary or multiple star system has not hitherto been determined. 
Here we report imaging of gas fragments emitting radiation char- 
acteristic of carbon monoxide within the GG Tau A cavity. From the 
kinematics we conclude that the flow appears capable of sustaining 
the inner disk (around GG Tau Aa) beyond the accretion lifetime, 
leaving time for planet formation to occur there. These results show 
the complexity of planet formation around multiple stars and con- 
firm the general picture predicted by numerical simulations. 

The triple stellar system GG Tau A consists of a single star GG Tau 
Aa and a close binary GG Tau Ab (with individual stars named Ab1 
and Ab2). The system is 1-5 million years old’*”’, and is located 140 pc 
from Earth in a hole of the Taurus molecular cloud. Its molecular emis- 
sion is free of contamination” and there is no known outflow or jet assoc- 
iated with the source. GG Tau Aa and Ab have an apparent separation 
of 35 astronomical units (AU) while the separation of GG Tau Ab1 and 
Ab2 is 4.5 Au (ref. 8). The outer Keplerian disk of gas and dust surround- 
ing GG Tau A (called the circumbinary disk for simplicity) consists of 
a ring extending from radius r~ 190 au to r~ 280 AU and an outer 
disk extending up to 800 Au from the central stars with a total mass of 
~0.15Mo (ref. 14; here Mo is the solar mass). 

Using the Atacama Large Millimetre Array (ALMA), we observed 
GG Tau A in the dust thermal emission at 0.45 mm wavelength and in 
the CO J = 6-5 line (Fig. la-c) with an angular resolution of 0 ~ 0.25” 
or ~35 AU. The continuum image shows cold dust emission from only 
one circumstellar disk-like structure associated with GG Tau Aa’”*. We 
estimate the minimum dust disk size to be ~7 au while the minimum 
mass of gas and dust is roughly 10° * Mo, about Jupiter’s mass. The com- 
plex CO J = 6-5 spectral line shape at the location of GG Tau Aa also 
reveals the existence of a CO circumstellar disk of outer radius ~20 au 
(Methods and Extended Data Fig. 1). We do not detect cold dust emis- 
sion around GG Tau Ab, even though the existence of an inner dust 
disk (or disks) has been inferred from unresolved infrared emission’®. 


Our 0.45-mm flux upper limits (Methods) are compatible with tidal 
truncation, which would prevent any circumstellar disk extending beyond 
about 2 Au (ref. 8). The ALMA CO J = 6-5 image (Fig. la-c and Extended 
Data Figs 1 and 2) also clearly resolves CO gas within the central cavity 
with a structure indicative of the streamer-like features which have been 
hinted at by hydrodynamic simulations in binary systems*'”. The CO 
gas appears to be inhomogeneous, existing as a series of fragments, and 
the structure is dominated by an east-west extension. No northern fea- 
ture is seen, contrary to the very low level (a signal-to-noise ratio of 2) 
extension seen in continuum at 270 GHz (ref. 15). 

The Institut de RadioAstronomie Millimetrique (IRAM) image 
(Fig. 1d-fand Extended Data Fig. 2) reveals, at lower angular resolution, 
that the CO J = 2-1 emission peaks are located near the inner edge of 
the outer ring (at radius ~ 100-150 av). In contrast, the CO J = 6-5 
emission peaks near GG Tau Aa and GG Tau Ab, close to the bright 
regions of near-infrared v = 1-0 S(1) H, emission (Fig. 1c) that have 
been interpreted as shock-excited gas at the interface between the stream- 
er and gas associated with the inner disks’®. A study of the excitation 
conditions (Methods) reveals that CO J = 2-1 and J = 6-5 emissions 
arise in different physical conditions: the former correspond to extended, 
cold (~35 K) optically thick areas, while the latter trace optically thin, 
warmer gas (~70 K) particularly at the interface between the streamer 
and the inner disk of Aa. The mass of each CO J = 6-5 fragment is 
about 5 X 10 °Mo (Methods). With a minimum accretion rate of 
10 *Moyr ',a fragment reaching the Aa disk may disappear in at 
most 5,000 years (a few tens of times the orbital period of the binary Aa- 
Ab). The Aa disk mass currently represents about 20 such fragments; 
in 1 Myr, at least 200 fragments of similar mass must have been accreted 
to sustain such a disk. The corresponding minimum mass, which accreted 
from the circumbinary disk, represents about 10% of the current outer 
disk mass (~0.15 Mao). The morphology of the gas in the CO J = 6-5 
and J = 2-1 maps reveals departures from symmetry, unlike hydrody- 
namical simulations which predict symmetric streamers for an equal- 
mass, low-eccentricity binary system’. GG Tau Aa and Ab each have a 
mass of about 0.65 Mo (ref. 13) and their orbital eccentricity is con- 
strained to e = 0.35 (ref. 18). The origin of this asymmetry might either 
be found in this eccentricity’ or in the triple nature of GG Tau A, as 
the binarity of Ab breaks the symmetry. 

The change in velocity of the CO J = 6-5 emission along the major 
axis of the dust ring is similar to that of CO J = 2-1 (Fig. 1b, e and 
Extended Data Figs 1 and 2) and the velocity gradient is that ofa rotating 
disk’**". At radius 200 au, we find that the velocity of the CO J = 6-5 
gas agrees with the known Keplerian speed (Methods and Extended 
Data Table 1) derived from existing '*CO maps'*”! and corresponds to 
the canonical dynamical mass of the triple star GG Tau A (1.28 Mo)"*”". 
This is still true down to a radius of ~70-80 Av. Closer to the stars, the 
velocity pattern of the CO J = 6-5 emission becomes dominated by the 
individual gravitational fields of GG Tau Aa and GG Tau Ab. Limited 
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Figure 1 | ALMA and IRAM images of GG Tau A. a-c, ALMA; d-f, IRAM. 
a, 0.45-mm emission (black contours) and CO 6-5 flux (colour: see colour 
scale at top). b, 0.45-mm emission (black contours) and CO velocity field 
(colour). c, 0.45-mm emission (colour) with CO 6-5 flux (blue contours) and in 
inset H, intensity (red). d, e, as a, b, but for 1.3-mm emission (contours) and 
CO 2-1 flux (colour). f, 1.3-mm emission (colour) and H) intensity (contours). 


angular resolution (35 Au) precludes detailed kinematical analysis, but 
study of the northern CO J = 6-5 peak vicinity reveals that its velocity 
gradient is dominated by rotation, with departure from rotation com- 
patible with infall motion (Methods and Extended Data Fig. 3). 
Another result concerns the dust and gas circumbinary reservoir. The 
dust emission from the circumbinary ring is very well resolved (Fig. 1 
and Extended Data Fig. 4) but appears uniform within the noise. The 
combination of the 0.45-mm ALMA image with existing 1.3- and 3.4-mm 
continuum maps’*”' allows for a direct measurement of the radial de- 
pendence of the temperature in the circumbinary dust disk. Using a 
radiative transfer model (Methods), we determined the dust temper- 
ature profile Tp(r) = 13.8 X (7/200 AU) ~ 1K which gives a temperature 
of ~8.5 K at 300 Au. Thanks to the 0.45-mm image, the derived dust 
temperature is almost independent of the dust properties. This repre- 
sents a significant improvement on the knowledge of the mid-plane 
physical conditions (Methods, Extended Data Table 2 and Extended 
Data Fig. 4). Since our measurement is the result of vertical averaging 
of the temperature, and the upper layers of the disk are hotter owing to 
direct irradiation by the stellar ultraviolet flux, the mid-plane must be 
even colder. The limited grain growth (Methods) helps explain the low 
temperature, since small grains are most efficient at attenuating ultra- 
violet radiation coming from the central stars. Furthermore, at the ring 
inner edge, a puffed-up rim, caused by direct stellar heating, should cast 
a shadow on the outer region of the disk. Such a shielding effect could 
explain the steep slope of the temperature profile. Previous '*CO mea- 
surements and analysis yielded a similarly steep slope (exponent ~0.9) 
for the gas temperature profile in the circumbinary disk but a warmer 


Positions are relative to right ascension (RA) 04h 32 min 30.359 s and 
declination (dec.) 17° 31’ 40.38” (J2000). Crosses are the locations of Aa (south) 
and Ab (north) components, and triangles and squares show the locations 

of the CO J = 6-5 and J = 2-1 peaks, respectively. Units on the colour scales are 
Jy per beam km s — '(a,d), kms ‘ (b,e) and mJy per beam (c, f). Ina and d, the 
beam size is given in the inset. 


gas temperature (~ 20K, at 300 av)”’. This is consistent with CO mole- 
cules being trapped onto grains in the mid-plane, whose temperature is 
below the CO freeze-out point (~17 K), and with CO gas only present 
in the heated upper layers of the disk”. 

Figure 1 shows that, unlike the continuum emission of dust, little CO 
J = 6-5 line emission is detected near the circumbinary disk inner radius 
at ~200-250 au. This could be attributable to both an H, density insuf- 
ficient to thermalize the upper energy level of the CO transition and 
lower sensitivity of the interferometer to relatively large angular scale 
(>1”) structure (Methods). Figure 1 also shows strong excess of CO 
J = 6-5 and 2-1 emission at position angle ~120° (measured east of 
north) and radius ~240-270 au. From these CO data alone we cannot 
discriminate between a temperature and/or density increase (Methods). 
We suggest that this localized emission is attributable to a progenitor of 
asub-stellar body reminiscent of the planetary-mass companion imaged 
around the low-mass binary 2MASS 0103AB”*. Such a companion would 
provide a natural explanation for the confinement of ~80% of the cir- 
cumbinary mass within the ~90 au breadth of the ring’. Simulations 
of the evolution of binary systems in which Kepler detected exoplanets 
demonstrate that Saturn-mass proto-planets can remain at the ring outer 
edge in some cases”. We do not, however, detect a gap near the outer 
edge of the gas and dust circumbinary ring but existing '*CO, '*CO and 
thermal dust images are not incompatible with its existence (Methods). 

Our observations demonstrate that active replenishment from the 
outer disk can sustain the circumstellar disk surrounding GG Tau Aa 
for a time exceeding the accretion lifetime, increasing its potential for 
planet formation. The presence of a condensation at the outer edge of 
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the ring is puzzling, and needs further investigation to determine its 
links with accretion processes and possible planet formation. Since almost 
half of Sun-like stars were born in multiple systems”, our observations 
provide a step towards understanding the true complexity of proto- 
planetary disks in multiple stellar systems and unveiling planet formation 
mechanisms for a significant fraction of stellar systems in our Galaxy. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

ALMA observations. GG Tau was observed with ALMA in Cycle 0 (project 2011. 
0.00059.S) in Band 9 on August 13 2012. The spectral set-up was of 4 spectral win- 
dows of total width 0.938 GHz, centred at 691.485, 689, 676 and 674 GHz respect- 
ively. The correlator observed in dual polarization, yielding a channel spacing of 
244 kHz, or 0.11 kms~?. With Hanning smoothing, the spectral resolution is twice 
those values. The array of 23 antennas was in its extended configuration. The water 
vapour was about 0.25 mm and system temperatures ranged from 500 to 1,000 K. 
Comparison of the CO data with existing CO J=2-1IRAM data (Fig. 1) revealed 
a frequency shift of the ALMA observations whose origin could not be identified. 
We corrected the velocity offset by correlating with the IRAM data. The velocity 
difference was 0.97 + 0.01kms |. Since it is an offset, it does not introduce any 
error on the velocity gradient. The absolute flux scale was determined using Ceres. 
The ALMA data were calibrated using CASA then exported through UVFITS to be 
imaged and analysed using GILDAS. Extended Data Fig. 1 presents the resulting 
channel maps. 

The integrated ring+disk continuum flux is in good agreement with the pre- 
vious unresolved detection”: no more than 20% of the flux is lost. 

For the CO J = 6-5 line, we only recover a fraction of the integrated line flux of 
~38+ 8Jykm s | (ref. 26). The ALMA antenna configuration exhibits a lack of 
short baselines which induces a partial loss of flux for the extended structures that 
deconvolution cannot recover, particularly in the CO circumbinary disk which 
extends up to 800 Au. We ran several disk simulations of the CO J = 6-5 emission 
using the ALMA simulator’’. If emission only arises from the ring, our simulation 
shows that about 20% of the emission would be filtered out by the configuration of 
ALMA. A much larger fraction is lost if CO J = 6-5 extends into the whole outer 
disk (up to 800 Av); this fraction depends on the signal to noise ratio. The expected 
extension of the CO J = 6-5 emission depends on the excitation conditions in the 
outer disk, so uncertainties on the amount of flux loss preclude any quantitative 
analysis of the large scale disk emission in CO J = 6-5 data. 

IRAM data. We complement ALMA data with CO J = 2-1 and continuum images 
from the IRAM interferometer. The compact configuration was used for observa- 
tions on 5-6 December 1997, and long baselines up to 400 m were used on 28-31 
January 1998 and 18 March 1998’. The unpublished extended configuration data 
were obtained on 16 February 2008. System temperatures were 120 K. Flux calibration, 
based on MWC3439, gives an absolute flux accuracy of about 10%. Imaging was 
performed using GILDAS software. The resulting beam size is 0.65” X 0.29” at pos- 
ition angle PA = 21°. The spectral resolution is 0.10 km s— ! and the effective noise 
is about 14 mJy per beam (up to 20 mJy per beam on the channels with the most 
extended emission due to deconvolution limits). The continuum image at 1.3 mm 
was produced using line-free channels. Its effective noise is limited by dynamic 
range to about 1 mJy per beam. Extended Data Fig. 2 shows the CO J = 2-1 chan- 
nel maps. 

Proper motions. Accounting for the expected orbital motion, we adopted proper 
motions of (17, —19) mas yr’! for the centre of mass of the system”* to merge all 
data. 

Inner dust disk properties. For the disk of GG Tau Aa, the total fluxes of 55 mJy at 
0.45 mm and 10 mJy at 1.3 mm are consistent with optically thick dust emission at 
a mean temperature of ~35 K (ref. 29) and an outer radius of the order of 7 Au, a 
size in agreement with tidal truncation by GG Tau Ab. Assuming a standard dust 
opacity of 0.02 cm* g~' at 1.3 mm, the minimum disk mass is roughly 10°? Mo or 
a Jupiter’s mass worth of gas and dust. The rate at which matter is accreting from 
the disk onto the surface of GG Tau Aa is in the range (~ 10°-7°°-10°- 8) Ma yr! 
(ref. 12). Such a disk would dissipate in less than ~(3 X 10*)-10° yr without external 
replenishment. We place an upper limit on the circumstellar disk mass surround- 
ing GG Tau Ab of ~10°° Mo at 3a (a little more than one Earth mass) or an 
upper limit on its radius of ~1 au by extrapolating the dust opacity to 0.45 mm, 
0.067 cm’ g” ', and using a similar temperature as in the disk of GG Tau Aa. This is 
consistent with the separation of Abl and Ab2 of ~4.5 au (ref. 8). 

Inner cavity. CO excitation conditions. We performed escape probability calcu- 
lations using the code RADEX™ (http://www.sron.rug.nl/~vdtak/radex/radex.php), 
a radiative transfer code dedicated to calculations of molecular line excitations. At 
the location of the eastern CO J = 2-1 peak (~80 au from the stars), the CO lines 
are optically thick, H2 densities must be greater than 10° cm >, and the gas is cold 
with a temperature in the range ~30-40 K. Closer to the stars, the peaks of the 
bright CO J = 6-5 regions correspond to optically thin emission, with a warmer 
temperature of about 70 K and a H) density of ~5 X 10°cm *. These densities 
agree with the pre-shock gas densities derived from shock models for the 2.12-m 
H, emission maps. With a standard CO/H) ratio of 10 *, the mass of each optically 
thin clump is of the order of ~(3-7) X 10° Mo. 

CO velocity field. We studied the velocity variations of the CO J = 6-5 emission. 
The spectra displayed in Extended Data Fig. 3, at the locations of the CO J = 6-5 
clumps, also show that the velocity is dominated by rotation. The complexity of the 
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system (which is triple) and the quality of the data preclude any detailed modelling 
of the velocity field, but some points can be addressed. From a compilation of existing 
values in the literature, we derive for GG Tau Aaa velocity of 5.8 + 0.2km s | (ref. 31). 
This implies a difference of the order of 0.6 + 0.2kms_' from the known systemic 
(barycentric) velocity of the whole stellar system’. Extended Data Fig. 3 reveals 
that the measured CO gas velocity near GG Tau Aa is in very good agreement with 
Aa velocity as expected for bound motions. Around the disk minor axis, we note 
that the isovelocity contours are twisted at a radius of about 70-80 au from the 
mass centre. Inside this radius, the isocontours are closely packed, and suddenly 
spread out beyond. Furthermore, the isovelocity contours are aligned on the Aa— 
Ab line between GG Tau Aa and Ab. This is exactly what is expected when the 
gravitation field, externally dominated by the total stellar mass (1.28 Ma), becomes 
dominated by the individual mass of the star Aa and binary Ab. At the location of 
the CO J = 6-5 clumps, the spectra displayed in Extended Data Fig. 3 also exhibit a 
large velocity dispersion (5v ~ 2-2.5kms_') which could be partly due to infall. 
We check this by analysing the velocity field along the minor axis at the northern 
clump. In an inclined disk, projected velocities due to infall are at a maximum 
along the minor axis, contrary to rotation velocities which are at a maximum along 
the major axis’. Taking into account the beam size of 35 Au, we find that the 
maximum velocity dispersion along the minor axis in a spectrum would be of the 
order 1.2kms_’ if it is only due to Keplerian rotation around central objects of 
0.65 Me (because this CO J = 6-5 clump is located in the area where the velocity 
field is dominated by GG Tau Ab). This is a factor two lower than the observed line 
widths of 2-2.5kms_'. In disks, turbulence cannot explain such a large disper- 
sion**, However, infall velocities can be \/2 larger than Keplerian velocities. It is 
therefore reasonable to conclude that the observed velocities and line widths are 
compatible with a combination of rotation and infall. For the southern clump, 
located near GG Tau Aa, the velocity profile is very complex. For instance, the 
spectrum of the Aa star shows two double peaked profiles, perfectly centred on the 
velocity of Aa (5.8 km s '), but with significantly different widths (full-width at 
half-maximum, FWHM, ~1.5 and6kms/?). The total velocity dispersion of the 
broader double peaked profile (~6 km s_') is compatible with a circumstellar disk 
of outer radius of the order of ~20 au (ref. 35), if the inclination of the system is the 
same as that of the outer ring (35°). Such a CO outer radius is compatible with the 
value derived from the dust (provided a small part of the dust emission is optically 
thin) and tidal truncation due to GG Tau Ab (the physical star separation being 
about 42 Au). 

Outer disk. Dust ring properties. We use Diskfit, a radiative transfer code ded- 
icated to the modelling of the disk images in mm/submm regime”, to analyse the 
dust properties inside the outer ring using the best data available (3 mm and 1.3 mm 
PdBI data and 0.45 mm ALMA observations). A modified Levenberg-Marquardt 
scheme with step adjustment is used to derive the best fit®”. The analysis is per- 
formed in the Fourier plane to avoid non linear effects due to the deconvolution. 
We explore large ranges for all parameters to avoid local minima. Extended Data 
Fig. 4 and Extended Data Table 2 present the results, with 1o formal error bars 
derived from the covariance matrix. The errors are computed on the basis of the 
difference between data and model, so include dynamic range limitations in addi- 
tion to thermal noise. We use simple truncated power laws to model the surface 
density and temperature. We allow dust emissivity «, to vary with frequency as 
Ky = Ko X (v/v0)? (B being the emissivity index). Such a simple model is valid for 
the dust distribution since the emission is known to be confined in a ring with 
sharp edges’?'. The derived geometrical parameters are in very good agreement 
with the previous results’. Since the dust temperature is mostly constrained by the 
ratio of the flux density at 0.45 mm and 1.3 mm, we checked that the results remain 
robust even assuming an absolute flux calibration error as high as 20% at 0.45 mm. 
The density, temperature and dust spectral index are robust with respect to all mini- 
mizations. However, we find a slight dependence between f and the surface density. 
This is not surprising because in the optically thin case, the brightness temperature 
is proportional to t x «(0.45 mm) X X(r) (with «(0.45 mm) the absorption coef- 
ficient of the dust at 0.45 mm). A better removal of this dependence would require 
higher angular resolution at 1.3 mm. We found a value for the emissivity index 
6 =1.0 £ 0.1, which indicates moderate grain growth compared to interstellar 
dust. Such a grain growth is compatible with a recent analysis of the dust ring 
performed between 7 and 1.3 mm’. We found a marginal dependency of f with 
radius, suggesting that larger dust particles reside near the inner edge of the outer 
disk (that is, closer to the central stars), as observed for some circumstellar disks 
orbiting young single stars**. So far, dust temperatures have been derived either 
from Monte-Carlo dust disk models” or one-dimensional thermal models assoc- 
iated with pure mm data’. In our case, we are able to directly derive the dust temper- 
ature thanks to the deviation from the Rayleigh Jeans regime at 0.45 mm, combined 
with sufficiently high spatial resolution at 0.45 mm and 1.3 mm. This permits the 
removal of the degeneracy between the temperature and the opacity. This direct 
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measurement is of great interest for a better characterization of disk mid-planes 
where planet formation is expected to occur. 

CO excitation conditions. Analysis of the physical conditions in the outer ring at 
radius around 200 au shows that the mid-plane H, density (~10° cm~*) is high 
enough to thermalize CO transitions up to J = 6-5”’. Excitation of the CO J = 6-5 
line requires a H; critical density of ~3 X 10°cm~*. We also derive a dust tem- 
perature lower than the CO freeze-out temperature of 17 K. Therefore the observed 
large scale emissions from the CO J = 1-0 and J = 2-1 transitions must arise from 
a warmer molecular layer located above the mid-plane, as predicted by layered disk 
models” and revealed by ALMA observations of HD 163296”. 

Using our measurement of the dust temperature, we find that the hydrostatic 
scale height of the gas and dust is about ~25 and ~35 Av at radii of 200 and 300 Au, 
respectively. If the CO gas originates in a layer located at about 3 scale heights (in 
agreement with chemical models), and taking into account the vertical temper- 
ature gradient, the H, density would be ~ 10° and ~10*cm? at radii of 200 and 
300 Au, respectively. This is sufficient to thermalize the J = 1-0 and J = 2-1 lines, 
but not the J = 6-5 transition, which can be partially subthermally excited outside 
the ring. Note that any gap at the outer edge of the ring created by an embedded 
protoplanet (see section below) would imply some subthermal excitation there. 
A CO hotspot at the outer edge of the ring. The CO J = 6-5 and J = 2-1 maps 
reveal a brightness enhancement of unknown origin near the outer edge of the dust 
ring (Fig. 1 and spectrum in Extended Data Fig. 3). From the CO J = 2-1 emission 
which is optically thick and thermalized, we derive a minimum temperature of 
~40K in this clump. This indicates a temperature enhancement of at least 20 K 
compared to the surrounding optically thick CO gas. The feature is clearly domi- 
nated by the gas emission, unlike the recent detection of a dusty vortex-like struc- 
ture, or dust trap, in the circumstellar disk associated with Oph IRS 48". In the high 
angular resolution continuum maps at 0.45 mm and 1.3 mm, there is no dust coun- 
terpart to this bright CO peak. The dust absorption coefficient being small, only a 
sufficiently large and massive dust clump would be detectable. Given the effective 
(dynamic range limited) noise, the (dust) mass limit is a few times 10 — 7 Mo at 3o, or 
the size must be smaller than about 4 Au, assuming a dust temperature of 40 K. In 
Extended Data Fig. 3, there is a marginal excess emission at 1.3 mm (4c level) in the 
difference map (right panel) at the location of the blob. This has to be taken with cau- 
tion since there are other 40 peaks which are likely to trace some extended cold emis- 
sion beyond the ring outer radius”’. Analysis of the GG Tau A dust ring in term of a 
dust trap, based on resolved images between 7 and 1.3 mm, has been recently achieved 
but remains inconclusive’. The presence 10” away of the binary star GG Tau B could 
induce gravitational disturbances in the circumbinary disk especially if its orbit is 
eccentric’. However, the total mass of GG Tau B is very low (~0.17 Mo; ref. 13) 
and it is located far away from the GG Tau A circumbinary ring (~1,400 au). 
Therefore, we conclude that it is unlikely that GG Tau B would have induced such 
a sizeable disturbance in the circumbinary ring. We explore here another inter- 
esting possibility; this feature would be indirect evidence for an unknown (already 
formed) embedded companion that is still actively accreting material from the 
circumbinary disk”. In this scenario, the planet remains undetectable at millimetre 
wavelengths but the warmer accreting envelope can be seen***. The presence of such 
a companion would provide an explanation for the mass confinement in a narrow 
ring (~ 80% of the circumbinary mass is located within the ~90 au breadth ring”’). 
This situation is similar to the gas free, dusty ring orbiting the more evolved star 
Fomalhaut, where confinement by shepherding planets appears to be the most con- 
sistent explanation”. 

Simulations of the evolution of Kepler systems where exoplanets have been 
observed reveal that Saturn-like planets are expected to migrate to reach the 4:1 
resonance”’ without opening a gap, except if the disk is very cold, as is the case for 
the GG Tau ring. In such cold disks, a protoplanet can remain at the ring’s outer 
edge and the size of the gap will mainly depend on the planet mass”. So far, we 
have not detected a gap in the gas and dust circumbinary distribution of GG Tau A. 
Our CO J = 6-5 map shows that there is almost no CO J = 6-5 emission located 
outside the outer ring; this is expected if there is a gap or a strong density decrease 
which implies subthermal excitation. Gap opening depends on several physical 
parameters, such as the mass of the planet, and the disk viscosity and aspect ratio”. 
For example, a Saturn-like body located at 250 au in the circumbinary disk would 
open a gap of about 35 AU assuming standard viscosity (~ = 10”) and aspect ratio 
h(r)/r = 0.1. Under the same physical conditions, the gap would be only 12 au 
wide for a Neptune-like planet**. Observing a gap in the existing CO J = 2-1 


map would be difficult because the CO J = 2-1 line is very optically thick, making 
the expected ring/gap brightness contrast difficult to detect, except if the gap is very 
deep with a high ring/gap density contrast (larger than the CO opacity), and is large 
enough to be directly seen (about 40 au given the beam size). ‘*CO is less optically 
thick (by a factor of 70) but the existing '*CO J = 2-1 map”! has an angular reso- 
lution (0.88 X 0.56" or 120 X 80 AU) insufficient for direct imaging. However, a 
ring/outer disk density contrast of ~25 at 260 au has been reported”’. This contrast 
could be the signature of an unresolved gap in the '*CO J = 2-1 map. A spectro- 
scopic detection*” would be possible only by using very high sensitivity data, which 
do not exist so far. Very sensitive observations of optically thinner CO isotopolo- 
gues at high angular resolution (< 0.1—0.2”) are needed to study the hot spot’s 
properties (including the viewing of a gap) and its implications for planet forma- 
tion scenarios. 
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Extended Data Figure 1 | ALMA CO J= 6-5 velocity channel map. For each spectroscopic channel, the velocity is given at top left. a, Full map; b, inner zoom. 
The beam size is 0.29” X 0.25” at PA 68°. The level step is 100 mJy per beam or 3.51 K corresponding to 3.40. 
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Extended Data Figure 2 | IRAM CO J= 2-1 velocity channel map. For each spectroscopic channel, the velocity is given at top left. a, Full map; b, inner zoom. 
The beam size is 0.68” X 0.31” at PA 21°. The level step is 50 mJy per beam or 5.48 K corresponding to 3.850. 
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Extended Data Figure 3 | Montage of the CO J = 6-5 data. False colours and 
black contours show the integrated area. The velocity gradient is given in thick 
contours: blue (gas approaching), black (systemic velocity) and red (gas 
receding). Stars show the location of Aa (south) and Ab (north). The two large 
ellipses show the ring edges. The three spectra sets (y axis, intensity in units 
of Jy per beam; x axis, velocity in units of kms ') show the velocity gradient 
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along the northern/southern CO J = 6-5 clump, respectively (dominated by 
rotation). On spectra, the red line is the systemic velocity (6.4km s_'). From 
east to west, the black contours correspond to velocity contours of 6.0, 6.4 
and 6.8kms° '. The systemic velocity contour passes between the two stars 
(barycentre). The single spectrum corresponds to the location of the hotspot. 
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Extended Data Figure 4 | Dust ring best model. a, ALMA continuum data at and the best model, contour levels correspond to 2a. d, e, f, As a, b, c but for the 
0.45 mm (emission from Aa circumstellar disk has been removed). b, Best IRAM continuum data at 1.3 mm. 
model at 0.45 mm, same contour levels. c, Difference between the observations 
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Extended Data Table 1 | Parameters relevant to the analysis of the 
ALMA data 


CO J=2-1 analysis 


Systemic velocity | Visp | 6.38 + 0.02 km.s7! 
Orientation PA T+ 2° 
Inclination a 37+ 1° 


Disk outer radius | Rout | ~ 800 au 


Velocity law: V(r) = Vioo(qpom)” 


velocity at 100 au | Vio 34+ 0.1kms7! 


exponent v 05+ 0.1 


Values are adopted from previous work (see Methods). 
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Extended Data Table 2 | Best fit results for the GG Tau circumbinary dust disk, as derived from the whole continuum data set 


Geometry 
Orientation PA 6.54 0.2° 
Inclination a 39.0 + 0.2° 
Size 
Ring inner radius R; 193+ lau 
Ring outer radius R, 285+ lau 
Surface density law: ¥(r) = N200(a55) 7? 
H, surface density ~200 9+ 11074 cm~? 
exponent p 13+ 0.2 
Temperature law: T(r) = Tx00(spom) ? 
Dust Temperature T00 13.84 03K 
exponent q 11+ 0.1 
Gas-to-Dust Ratio G/D 100. - 
Absorption Coefficient (gas+dust) | k(1.3mm) | 0.02 cm’/g_ - 
Dust Spectral Index B 10+ 01 
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Quantum tomography of an electron 


T. Jullien’, P. Roulleau', B. Roche!, A. Cavanna’, Y. Jin? & D. C. Glattli! 


The complete knowledge of a quantum state allows the prediction of 
the probability of all possible measurement outcomes, a crucial step 
in quantum mechanics. It can be provided by tomographic methods’ 
which have been applied to atomic””’, molecular’, spin** and photonic 
states. For optical’° or microwave’* ’ photons, standard tomogra- 
phy is obtained by mixing the unknown state with a large-amplitude 
coherent photon field. However, for fermions such as electrons in 
condensed matter, this approach is not applicable because fermionic 
fields are limited to small amplitudes (at most one particle per state), 
and so far no determination of an electron wavefunction has been 
made. Recent proposals involving quantum conductors suggest that 
the wavefunction can be obtained by measuring the time-dependent 
current of electronic wave interferometers’ or the current noise of 
electronic Hanbury-Brown/Twiss interferometers'*’’. Here we show 
that such measurements are possible despite the extreme noise sensi- 
tivity required, and present the reconstructed wavefunction quasi- 
probability, or Wigner distribution function’’, of single electrons 
injected into a ballistic conductor. Many identical electrons are pre- 
pared in well-controlled quantum states called levitons"* by repeatedly 
applying Lorentzian voltage pulses to a contact on the conductor”. 
After passing through an electron beam splitter, the levitons are 
mixed with a weak-amplitude fermionic field formed by a coherent 
superposition of electron-hole pairs generated by a small alternat- 
ing current with a frequency that is a multiple of the voltage pulse 
frequency’’. Antibunching of the electrons and holes with the levi- 
tons at the beam splitter changes the leviton partition statistics, and 
the noise variations provide the energy density matrix elements of 
the levitons. This demonstration of quantum tomography makes 
the developing field of electron quantum optics with ballistic con- 
ductors a new test-bed for quantum information with fermions”. 
These results may find direct application in probing the entanglement 
of electron flying quantum bits”, electron decoherence” and electron 
interactions. They could also be applied to cold fermionic (or spin- 
1/2) atoms”. 

A quantum state yy contains all the information about a particle ora 
system. Disregarding spin for simplicity, a practical representation is 
given by the product of the wavefunction g which contains the spatial 
information and an occupation state in a Fock space representation. 
Although the determination of the latter requires similar experimental 
resources for a fermion and for a single boson, the complete determi- 
nation of the wavefunction by tomographic methods is fundamentally 
more demanding for fermions than for bosons. To emphasize this let 
us consider a single particle propagating in a single spatial mode (an elec- 
tron in an effectively one-dimensional ballistic conductor or a photon 
in an optical medium). In the space and time domain representation, 
the electron quantum state at position x and time t is g(t — x/vg)|f), 
where vr is the velocity, g is the wavefunction and |f) is the fermionic 
occupation state, | 1) or |0). Similarly, a photonic state is characterized 
by Eou(t — x/c)|b), where Ep is the electric field amplitude of a single 
photon, u is the spatial mode which plays the role of 9, |b) is, for exam- 
ple, a number state or a Glauber coherent state, and c is the speed of 
light. In quantum optics, the determination of the mode u can be done 


by_mixing with a coherent field (local oscillator) with amplitude 
/ NioEou(t —x/c), where No is the mean photon number. Then a 
classical measurement of u can be made because the fundamental quan- 


tum measurement uncertainty ~ 1 / No vanishes for large \/ Nio. 
Only the determination of the bosonic state |b) requires purely quan- 
tum tomography. For fermions, however, the state occupation being 
limited to 1, no classical amplitude level can be reached and a quantum 
tomography is necessary for both the wavefunction @ and the fermion 
state |f). In the present work, we prepare many electrons, each in the known 
fermionic state | f ) = |1). We show that, although extremely demanding, 
a quantum tomography of the wavefunction of a time-resolved single 
electron is possible using shot noise, and we determine the first-order 
coherence @*(é’)@(e) in the energy representation (where @(é) is the 
Fourier transform of g(t) and the asterisk denotes complex conjugation). 

We first explain how we prepare the single-electron state in a quan- 
tum conductor and then how the quantum tomography is performed. 
The quantum conductor is a quantum point contact (QPC) placed at 
low temperature (electron temperature, T, ~ 35 mK). It is formed in a 
high-mobility two-dimensional electron gas using split gates deposited 
on top of a GaAs/GaAIAs heterojunction (Fig. 1a and Supplementary 
Information). Applying negative voltage Vg to the gates creates a con- 
striction transmitting a limited number of electronic modes. Here we 
select a single mode whose transmission D( Vg) is obtained by measur- 
ing the conductance G = (2e?/h)D, where ¢ is the electron charge and h 
is Planck’s constant. The on-demand injection of charges into the con- 
ductor is obtained by applying a voltage pulse V;(¢) on one of the ohmic 
contacts adjacent to the QPC, say the left. If the voltage flux satisfies 
e Wiese V(t) dt =h then the charge is unity. Moreover, a Lorentzian pulse 
V(t) =2hw/e(t? + w’), where ft = h/2n and 2wis the pulse width at mid 
height, creates a clean single-electron state’? called a leviton'’* with an 
energy just greater than the Fermi energy of the conductor. Remarkably, 
although the leviton originates from a collective displacement of the 
Fermi sea towards positive energy, it can be considered a single-particle 
state’! disentangled from the Fermi sea*’. A wavefunction representa- 
tion of the leviton in the energy domain is @(¢) = \/2w/h0(e)e—*"/", 
where 0 is the Heaviside step function. Experimentally, periodic trains 
of levitons have been generated’*. When partitioned by a QPC"* they 
generate a current shot noise corresponding to electrons prepared in 
the occupation state |1). Using shot-noise spectroscopy and electron 
Hong-Ou-Mandel noise correlation techniques, their wavefunction 
probability in the energy and time domain was found to be consistent 
with expectations. Thus, levitons provide a clean way of testing wave- 
function quantum tomography in a quantum conductor. 

We now discuss how to probe the first-order coherence from which 
the full wavefunction can be extracted. A first approach could be wave 
interferometry. In quantum optics, this gives G,(t — t’), which is the first- 
order coherence (i (t')y/(f) averaged over the mean time f= (t+ t’) /2 
(here y* is the Hermitian conjugate of /). The information is incom- 
plete, however, and other approaches must be used”’. Wave interfero- 
metry is more promising with electrons: measuring the time-dependent 
current of an electronic Mach-Zehnder interferometer has been pro- 
posed"*as an elegant way to determine the full coherence ( (WD), 


1Service de Physique de I’Etat Condensé, IRAMIS/DSM (CNRS URA 2464), CEA Saclay, F-91191 Gif-sur-Yvette, France. 7CNRS, Laboratoire de Photonique et de Nanostructures, Route de Nozay, 91460 


Marcoussis, France. 
*These authors contributed equally to this work. 
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Figure 1 | Schematics of quantum wave tomography. a, Experiment 
principle. Periodic Lorentzian voltage pulses V(t) applied to the left contact ofa 
two-dimensional electron gas inject unit charge pulses called levitons which are 
partitioned by an electronic beam splitter called a quantum point contact 
(QPC). The split-gate voltage Vg controls the transmission D of the one- 
dimensional electronic mode formed at the QPC. A d.c. voltage Vp (not shown) 
and a weak a.c. voltage Vio(t) = (kytohvo/e)cos(2mkvo(t — t)) are applied to 
the right contact The latter generates a small flux of electrons and holes 
(red-blue wavy line) which interferes with the incoming levitons (blue bumps) 
and is analogous to the local oscillator used in quantum optics. The red and 
blue arrows indicate the directions of electron-hole and leviton excitation 
propagation, respectively. Bicolour symbols indicate the outgoing states made 
of levitons mixed with the small flux of electron and hole excitations. Measuring 
the low-frequency current shot noise while varying Vz for different harmonics 
(k = 0, 1 and 2) provides a tomographic measurement of the leviton energy 
density matrix. b, Optical analogue. The local oscillator (LO) emits a 
monochromatic photon field, here of high amplitude, which interferes in the 
beam splitter with photons emitted by the source to be analysed. The photons 
detected provide a tomographic measurement of the photonic state. 


but so far no such experiments have been done. Another approach is 
performing shot-noise measurements. This is the approach followed 
in the present experimental work. Photon shot noise has been exploited 
using adaptive homodyne tomography” to infer the unknown spectro- 
temporal properties of a single photon mode u. A large local oscillator 
field is mixed with the single photon in a beam splitter and its temporal 
shape is tuned to maximize the output noise, providing a classical copy of 
the unknown mode (Fig. 1b). For electrons, a large fermion field cannot 
be used but the principle of homodyne mixing can be kept. A recent pro- 
posal is to reverse the amplitude hierarchy'®. Because it is the single-electron 
source corresponding to the maximum fermionic amplitude (as |f) = |1)), 
this is the local oscillator fermionic probe source whose amplitude has 
to be weak so that simple information can be provided by the noise var- 
iation at the beam-splitter output when mixing particles coming from 
the two sources. For simplicity, let us consider zero temperature. When 
a flux-quantized Lorentzian voltage pulse is applied to the left contact, 
a leviton is sent towards the QPC beam splitter. Repeating the experi- 
ment and recording the charge at the right contact gives the mean charge 
(Q) = De and the charge partition noise (AQ’) = D(1 — D)e”. Now let 
us apply a d.c. voltage Vp to the right contact, rising the electrochemical 
potential by eVg. The part of the leviton partition noise proportional to 


eVR ~ 


> | 9) G(e)de, corresponding to the energy range ¢ € [0, eVgl, is 


604 | NATURE | VOL 514 | 30 OCTOBER 2014 


b d 


a, % J eee (™) 


300 K 


i eR oy les 
I a | 
I I I y 
es ee | I y 
WWW p WUT é 13 mK 
35 Bias tee aPC Bias tee = 


c, Experiment principle in energy representation. All quantities are step 
functions of the energy with energy scale hv. Left: the energy dependence of the 
wavefunction of periodic levitons emitted from the left contact. Right: the 
energy distribution of the right reservoir when the voltage Vg + Vio(¢) is 
applied. For small 7,0, an electron emitted with energy ¢ arrives at the QPC 
in a superposition of states of energy ¢ + khyp. Its interference with the 
levitons changes the leviton partition noise by a quantity proportional to 

No (P(e) (e+ khvo) — @(e—khvo)@(e)), giving information on the 
non-diagonal energy density matrix. fz = |g(e)|?. d, The 6 GHz periodic 
Lorentzian voltage pulses are obtained from a frequency synthesizer by 
combining four harmonics with appropriate phases and amplitudes and sent to 
the left ohmic contact of the sample via 40 GHz d.c. transmission lines. To 
generate V;9 harmonics 1 and 2 are derived from the synthesizer, attenuated 
and sent to the right contact. A time delay is added using a phase shifter. Bias 
tees separate the high-frequency components from the d.c. voltage bias and the 
detected low-frequency current noise. The latter is converted into voltage 
fluctuations. A computer performs fast Fourier transform cross-correlations 
after cryogenic amplification and fast acquisition. Appropriate d.c. voltage 
biases are applied to the left and right contacts. 


replaced by the partition noise of the right electrons reduced by the 
factor 1— {* . @(€)*@(e)de because of antibunching. Differentiating 
the noise with respect to Vp provides the diagonal part of the energy 


density matrix of the leviton |9(¢ = eVg)|’. Remarkably, very little modi- 


fication is necessary to access the non-diagonal part (y*(e' A e)(e)). 
Let us superimpose on the right d.c. voltage the a.c. voltage Vio(t) = 
nio(hv/e)cos(2nv(t — t)) with small amplitude 7,9 < 1, frequency v 
and time delay t referred to the leviton emission time. This generates 
at the QPC a weak fermionic field made of electrons in a superposition 
of states with their initial energies shifted by +hv and with probability 
amplitudes 9, 9¢”"""/2 and — oe ”*""/2, respectively, the probability 
of keeping the initial energy being 1 — n,o ~ 1 (Fig. 1c). Their mixing 
with the levitons at the beam splitter gives the antibunching probability 
amplitude 


i2MVTt 


a | 
a(nio.é) =9" () (1 No’) 5? (e+hv)moe 


—i2nvt 


Vey 
= 3? (e—hv) moe 


Differentiating the noise with respect to eVp, and keeping only the a.c. 
contribution, gives a noise variation proportional to 
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NLoRe[P* (eV) @(eVR+hv)e?™" 
— 9 (eV) @(eVR —hv)e?™""] 


Fully quantum tomography can then be done by varying the voltage 
frequency and time delay t, providing the real and imaginary parts of 
the first-order coherence’®. Because the leviton wavefunction in the 
energy domain is real, we get 


ae CNLo cos(2mvt)[9*(eVr) @(eVR+hv) ) 


— 9 (eVr)@(eVe —hy)] 


Because only positive energy is concerned, equation (1) gives @(eVr)@ 
(eV. + hy) for 0< eV <hv,9(eVn)@(eVa + hv) —9(eVa —hv)O(eVe) 
for hv = eVp < 2hv and so on, providing a full determination of the 
energy density matrix. 

In our experiment, single-electron levitons are injected at a fre- 
quency Vo by applying periodic Lorentzian voltage pulses from the left 
contact. The full-width at mid-height is 2w = 30 ps and the period is 
T= vo |= 166ps. To extract the diagonal and off-diagonal parts of the 
energy density matrix, we follow the experimental approach discussed 
above with current noise measured instead of charge fluctuations. With 
Vp = 0 and in the absence of a.c. voltage, the partitioning of levitons 
generates the low-frequency current noise spectral density S?(0) = 4¥9 
(AQ’) = 4ve”D(1 — D) where double counting of the mode from spin 
degeneracy is included. Because of the periodicity, (w* («')(e)) = 
>, 0(e! —e—khvo) @* (e+ khvo) @(e), where 6 is the Dirac delta func- 
tion and k is an integer. Only energies separated by multiple of hvo 
contribute to off-diagonal terms, and the frequency of V;,(t) must be 
v = +kypo. Because high harmonics give too small a signal, measurements 
are restricted to k = 0, 1 and 2. 

The diagonal part (k = 0) is obtained by measuring the difference 
between the partition noise of single-electron levitons injected at fre- 
quency vo, using the periodic Lorentzian voltage V;(t), and that of con- 
tinuously injected electrons provided by a constant voltage equal to the 
mean value of the voltage pulse V(t) =hvg/e. The zero-temperature 
noise difference is'** 


Lp oO 
_ So 2s 2 
ASt=un)= FE ( | (1—B6)*)de— | $(6)" de— lay —ve (2) 
0 LR 


On the right-hand side, the first term represents the noise of right elec- 
trons emitted at energy below Lg = eVp, whose shot noise is reduced 
by antibunching with the leviton. The second term is the partition noise 
of levitons for energy above jig. It is followed by the subtracted d.c. shot 
noise. Because g(¢) varies by steps in hyo intervals, equation (2) dis- 
plays linear variation by parts. Introducing the notation o(e) =p 1, for 
lhvg <é< (1+ 1)hyo, this yields 0 for Lug < 0, (2(1 — poo) ta + ho — Lp)! 
hvo for 0 < UR < hyo, 2(1 — Poo) — 201,1(LlR — hVo)/hVo for hvo < LR 
<2hvo, and so on. The finite electron temperature T, ~ 35 mK ~ 
0.12hvo/kg smoothens the singularities separating linear variations. A 
finite-temperature expression, obtained in a Floquet scattering approach”, 
is given in Supplementary Information. Figure 2a shows measurements 
of Ask ~ © versus Ltr for levitons (open blue circles). From the data we 
extract p,, directly from a fit including the small-temperature effects. 
The solid red curve corresponds to the best four-parameter fit, from 
which the four first p;; (that is, ] = 0 to 3) are extracted (Supplementary 
Information). Self-heating including a small temperature increase (35 mK 
for Vp ~ 0 to 47 mK for Vp ~ 3hvo/e) has been included. The p;, are 
plotted in red in Fig. 2b. Comparison with theoretical values based on a 
Floquet scattering modelling (in black) is good within the experimental 
uncertainties. 

For the measurement of the off-diagonal part (k #0), we mea- 
sure the noise difference when switching on and off the amplitude of 
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Figure 2 | Measurement of the diagonal part of the energy density matrix. 
a, The data (blue circles) display the shot noise measured when applying a d.c. 
voltage hvo/e, minus the shot noise measured when sending single-charge 
levitons at frequency vo. The data are plotted versus the d.c. voltage Vp applied 
on the opposite contact. The red curve represents the best fit to equation (2), 
including finite temperature, from which the parameters pj) are extracted. The 
temperature T, is ~35 mK at Vp = 0 and ~48 mK at Vp = 3hyo/e. The 
transmission is D = 0.19 and S,° = 9.48 X 10-7? A? Hz. Error bars, s.e.m. 
calculated from the cross-correlation noise spectrum standard deviation in a 
800 kHz bandwidth and after n ~ 100,000 acquisitions. b, Diagonal part of the 
energy density matrix versus energy using the p;, extracted from the fit. 
Comparison with theory for levitons is in black. Error bars, s.e. calculated from 
the best fit. 


Violt) = niro(hv/e)cos(2nv(t — t)) superimposed on the right d.c. 
voltage Vp and synchronized with the Lorentzian pulses. The zero- 
temperature shot noise difference is 


0 


S 
ASK (tty) = favg 2AML0008(21tkv07) 


HR (3) 
x | ((e)H( e+ Khve) —G(e)H(e— Kho) a 


0 
This expression also varies linearly by parts because @(¢)@(¢+ khvo) 
= Pix for lhvo <é< (I+ Iho is a step function of the energy. For 
example, for k = 1 the integral in equation (3) is 0 for Ug <0; Po,1LR/ 
hvo for 0 < Ur< hyo; Po. — P1,2(UR — AVo)/hVo for hvo < Ug < 2hvo; 
and so on. Finite-temperature expressions are given in Supplementary 
Information. 

Figure 3a, b shows measurements of ASE versus t for k = 1, NLo 
= 0.095 and k = 2, nyo = 0.0425 at fixed pp ~ hypo. Clear oscillations 
are observed. The delay period in Fig. 3b is twice that observed in Fig. 3a. 
Before going further, we emphasize that observation of these oscilla- 
tions demonstrates off-diagonal coherence in an electronic orbital state, 
here a leviton. No experiment has given direct evidence of off-diagonal 
coherence of electrons in a conductor yet. We now compare the off- 
diagonal terms extracted from the measurements with those predicted 
for a leviton. Figure 3c, d shows the experimental variation of AS. 1 with 
Ltr for k = 1 and 2 (open blue circles), fixing t = 0 to maximize the 
signal. In both plots, the solid black curves represent the theoretical 
signal expected for a leviton. The red dashed curves are the best fits, 
from which the off-diagonal terms 4; can be extracted. For k =2a 
four-parameter fit is used and / = 0 to 3, and for k = 1 a five-parameter 
fit is used because a finite, but very small, hole excitation contribution 
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Figure 3 | Off-diagonal part of energy density 
matrix. a, b, Weak sine-wave voltages of 
frequencies and amplitudes vg and 7,9 = 0.095 
(a) and 2v9 and 7,9 = 0.0425 (b) superimposed on 
a fixed d.c. voltage eVg = hv applied on the right 
contact. The data (blue circles) show the noise 
difference obtained by switching on and off the 
sine-wave amplitude, plotted versus the time delay 
t between the sine-wave and the periodic 
Lorentzian pulses. The oscillation of the noise with 
t and the period doubled in b relative to a is the 
signature of off-diagonal coherence. The red curves 
are sinusoidal fits. c, d, Same shot noise difference 
measurements (blue circles) but plotted versus 
the d.c. right voltage with t = 0, chosen to 
maximize the signal. T. is ~35 mK at Vp = 0 and 
~48 mK at Vp = 3hvo/e. The red curves are the best 
fit of equation (3) with p;)+, as free parameters, 
including finite-temperature effects, with k = 1 (c) 
and k = 2 (d). The black curves are comparison 
with theory. Error bars, s.e.m. calculated as in 

Fig. 2a but with four times larger. e, f, Plot of 
9(€)0(e+hkyo) (red) using the parameters p+ 
extracted from the best fits of c and d with 

k= 1 (e) andk =2 (f), respectively. Expected 
values from theory are shown in black. Error bars, 
s.e. calculated from the best fit. 


Figure 4 | Wigner function and leviton 
wavefunction in the time domain. a, The zero- 
temperature Wigner distribution function of the 
levitons is reconstructed from the );+ values 
obtained from tomography. The WDF is limited to 
the first two temporal harmonics and to the 
energy domain & < 3.5hvo. Negative parts are 
characteristic of a wavefunction which cannot be 
described by a classical probability distribution in 
the time and energy domains. b, Theoretical WDF. 
The full WDF has been truncated to the first 

two harmonics and to the energy range probed 
experimentally, for better comparison. The 
two-dimensional colour plots are shown to 

better display the region of negative values of the 
WDE. c, Cut of the experimental and theoretical 
WDEFs at energy ¢ = 1.25hyo (for energies 

1.5hV9 > & > hyp both the first and second 
harmonics contribute to the WDF). Using the 
experimental errors in Fig. 3e, f, we have generated 
the two red dashed curves between which the 
experimental WDF is expected to lie. Robust 
negative values of the WDF are found, a hallmark 
of interference effects. d, Projecting the WDF along 
the energy direction gives access to the temporal 
variation of the electron probability distribution 
of the leviton |g(¢)|”. The experimental data are 
(red) are compared with the theoretical truncated 
WDE (black). 


(1= —1) was found necessary to improve the fit. They are shown in 
Fig. 3e, f. The errors bars are obtained from the standard error of the 
fit. They are much larger than those in Fig. 2b because the measured 
noise is about ten times smaller. 

The noise oscillations with phase in Fig. 3a, b and the noise variations 
with voltage in Fig. 3c, d are well reproduced by a model including only 
the 35-50 mK thermal rounding. Supplementary Fig. 2 provides addi- 
tional tomographic noise measurements performed on two-electron 
sine-wave pulses. Here again good agreement is found between data 
and theory. Deviation arising from decoherence effects due to electron- 
electron interaction is not expected here because the two-dimensional 
Landau quasi-particle lifetime is longer than the (50 mK) thermal time 
for the typical energies probed here (Supplementary Information). 

From the experimental values of 9(¢)@(¢+ khvg) obtained above we 
can now reconstruct part of the zero-temperature Wigner distribution 
function” (WDF) of the levitons: W(t,¢) = | : (Wt (e+6/2)W(e- 
5/2))e—*/" dé, The WDF gives the full information on the complex 
wavefunction. It has been calculated for a single Lorentzian pulse’* and 
for periodic pulses'’. In the latter case, the WDF is also periodic. Its 
expression in terms of the quantities 01); is given in Supplementary 
Information. Figure 4a and Fig. 4b respectively give three-dimensional 
plots of the experimental and theoretical WDFs. Because the experi- 
mental WDF is limited to the first two harmonics and the energy range 
—hvo = & < 3.5hy9, the theoretical WDF is truncated accordingly for 
comparison. To better show the agreement between experiment and 
theory, we provide in Fig. 4c a cut of the WDF at energy ¢ = 1.25hv9 
(red, experiment; black, theory). Using the experimental errors of 01+ 
(1 = 0) in Fig. 3e, f, we have generated the two red dashed curves in 
Fig. 4c between which the experimental WDF is expected to be located. 
For reduced times t/T around ~0.3 and 0.7 we clearly find robust regions 
of negative WDF, a hallmark of a quantum state. 

Finally, as a particular application of the WDF, the integration over 
energy ¢ gives the probability |¢(t)|. Its experimental (red solid curve) 
and theoretical (black dashed curve) variations with time are given in 
Fig. 4d. The excellent agreement validates our experimental determina- 
tion of the WDF from the tomographic noise procedure of ref. 16. This 
provides better information on the temporal shape of the wavefunction 
than the convolution |(g(t)|g(t + t))|? given by the electronic Hong- 
Ou-Mandel method used for levitons in ref. 18 or for single electrons 
in ref. 31. 

Using the known leviton state, we have demonstrated that the quan- 
tum wave tomography of itinerant electrons can be done experimentally 
using shot noise. Probing the off-diagonal energy density matrix opens 
the possibility of studying more complex electron quantum states such 
as entangled electron quantum bits in ballistic conductors or complex 
electron quantum states built by interaction. Examples of the latter include 
Kondo electronic states, fractional quantum Hall quasi-particles or elec- 
tron Andreev pairs built by superconducting proximity effects. 
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Room-temperature magnetic order on zigzag edges 
of narrow graphene nanoribbons 
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The possibility that non-magnetic materials such as carbon could 
exhibit a novel type of s—p electron magnetism has attracted much 
attention over the years, not least because such magnetic order is pre- 
dicted to be stable at high temperatures’. It has been demonstrated 
that atomic-scale structural defects of graphene can host unpaired 
spins”’, but it remains unclear under what conditions long-range mag- 
netic order can emerge from such defect-bound magnetic moments. 
Here we propose that, in contrast to random defect distributions, 
atomic-scale engineering of graphene edges with specific crystallo- 
graphic orientation—comprising edge atoms from only one sub-lattice 
of the bipartite graphene lattice—can give rise to a robust magnetic 
order. We use a nanofabrication technique* based on scanning tun- 
nelling microscopy to define graphene nanoribbons with nanometre 
precision and well-defined crystallographic edge orientations. Al- 
though so-called ‘armchair’ ribbons display quantum confinement 
gaps, ribbons with the ‘zigzag’ edge structure that are narrower than 
7 nanometres exhibit an electronic bandgap of about 0.2-0.3 elec- 
tronvolts, which can be identified as a signature of interaction-induced 
spin ordering along their edges. Moreover, upon increasing the rib- 
bon width, a semiconductor-to-metal transition is revealed, indica- 
ting the switching of the magnetic coupling between opposite ribbon 
edges from the antiferromagnetic to the ferromagnetic configura- 
tion. We found that the magnetic order on graphene edges of con- 
trolled zigzag orientation can be stable even at room temperature, 
raising hopes of graphene-based spintronic devices operating under 
ambient conditions. 

The intrinsic magnetism of graphite has a long and controversial 
history’. The origin of the measured magnetic signal is generally attrib- 
uted to atomic-scale structural defects locally breaking the sub-lattice 
balance of the bipartite hexagonal lattice**. However, unambiguous iden- 
tification of the structural sources of the measured magnetic signal has 
proved challenging because the sources are buried inside the bulk of the 
material. The isolation of single graphene layers’ opens up new pro- 
spects in this direction*” because their atomic structure is fully access- 
ible for imaging and controlled modification. In particular, the graphene 
edges of a specific crystallographic orientation comprising carbon atoms 
from only one sub-lattice of the bipartite hexagonal lattice (zigzag) are 
predicted to host magnetic order"®, in striking contrast to graphene edges 
that incorporate an equal number of carbon atoms from both sub-lattices 
(armchair). 

The strong influence of edge orientation on the electronic structure 
of graphene nanoribbons was predicted” long before graphene was 
isolated’. However, a systematic experimental verification of this fun- 
damental prediction is still lacking, owing to very limited control over 
the edge orientation of the graphene ribbons investigated. Opening of a 
gap inversely proportional to the ribbon width has been experiment- 
ally demonstrated by electrical transport measurements'*”’; however, 
no evidence on the effect of edge orientation has been revealed. This is 
attributed mainly to the random edge orientations and the presence of 


a considerable, nanometre-scale edge roughness, suppressing orienta- 
tion effects'*. Scanning tunnelling microscopy (STM) study of irregu- 
larly shaped graphene ribbons revealed that structures with more zigzag 
edges display smaller bandgaps than those with more armchair edges"». 
This is clearly indicative of edge-specific physics; however, the lack of 
orientation control did not allow a more systematic insight. In theory, 
there is a broad consensus that graphene nanoribbons with armchair 
edges are semiconductors, owing to the quantum confinement of the 
charge carriers, while zigzag ribbons host peculiar flat-band edge states’”’*. 
In the picture neglecting electron-electron interactions, these edge states 
localized near the Fermi level render all zigzag graphene nanoribbons 
metallic. However, such one-dimensional metallic edge states with a high 
local density of states at the Fermi level become unstable upon electron— 
electron interactions. To lower the energy of the system, it is favourable 
to split the flat band (open a gap) by ordering the spins along the two 
ribbon edges with antiferromagnetic coupling between opposite edges. 
Consequently, the emergence of magnetic order is closely linked to al- 
tering the electronic structure of the ribbons, through opening a band- 
gap in the otherwise metallic zigzag nanoribbons'*’®'”. This enables us 
to detect the signature of edge magnetism on individual graphene nano- 
structures by investigating their electronic structure (directly measuring 
magnetic signals would require a macroscopic amount of such ribbons). 
This edge-state magnetism and the associated bandgap opening in zig- 
zag ribbons is consistently predicted by various theoretical models, 
including first principles density functional theory'’, and mean-field- 
theory-based Hubbard’* and quantum Monte Carlo calculations”, indi- 
cating that edge magnetism is a robust property of zigzag graphene 
nanoribbons, not sensitive to the specific details of the models. How- 
ever, the stability of the magnetic order on real graphene edges and 
experimental conditions is strongly debated. Experimental indications 
that edge magnetism can indeed occur at low temperatures (7 K) have 
been provided by tunnelling spectroscopy measurements on graphene 
ribbons obtained by unzipping carbon nanotubes”’. However, the ran- 
dom orientation of the edges and the influence ofa possible strong edge- 
substrate hybridization’ did not allow edge magnetism in graphene to 
be properly characterized. Although the magnetic order is expected to 
persist to some extent on zigzag segments of randomly oriented gra- 
phene edges, the mixing of different edge types is expected to substan- 
tially weaken the effect'’”’. Therefore, the lack of experimental control 
over edge orientation seems to be one of the main reasons that the mag- 
netic graphene edge states consistently predicted by various theoretical 
models have remained experimentally so elusive. 

To realize graphene nanoribbons with precisely controlled crystal- 
lographic edge orientations, we used a nanofabrication technique based 
on STM*. In atomic-resolution images, the crystallographic directions 
of the graphene lattice can be identified and matched with the desired 
cutting directions. The direct cutting of graphene is done by beam- 
induced chemical etching activated by the sub-nanometre-wide chan- 
nel of tunnelling electrons, locally breaking the carbon-carbon bonds 
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Figure 1 | Fabrication of graphene nanoribbons with precisely defined 
crystallographic edge orientations. STM image (500 mV, 0.8 nA) of a 5-nm- 
wide graphene nanoribbon with armchair edge orientation (a) and STM image 
of a 6.5-nm-wide ribbon with edges of precisely zigzag orientation (300 mV, 
2nA) (b) patterned by scanning tunnelling lithography in a graphene sheet 


underneath the atomically sharp STM tip apex. More details on the 
nanolithographic process are given in the Methods. 

Using the STM nanolithographic technique, we defined nanoribbons 
into graphene sheets grown by chemical vapour deposition and trans- 
ferred onto gold (Au) substrates with large Au (111) terraces. Au (111) 
has been predicted to be one of the few substrates preserving edge mag- 
netism in supported graphene ribbons”’. Nanoribbons with pre-defined 
armchair or zigzag edge orientation and widths ranging down to 3 nm 
were defined (Fig. 1). The as-fabricated ribbons display regular edges 
of subnanometre roughness. The bright protrusions near the edges in 
Fig. 1 are due to a known imaging instability”, rather than edge defects 
or impurities. Upon optimizing the STM imaging, we found the edges 
defined by STM lithography to be straight and close to atomically smooth, 
free of detectable impurities, reconstructions or curvature (Extended 
Data Fig. 1). 

Tunnelling spectra acquired on armchair ribbons displayed semi- 
conducting current-voltage (I-V) characteristics, in contrast to spec- 
tra taken on graphene outside the ribbons (see Extended Data Fig. 2). 
The measured bandgap asa function of the corresponding ribbon width 
is plotted in Fig. 2a. A clear inverse proportionality is revealed, in ex- 
cellent quantitative agreement with our calculations based on the Hub- 
bard model (equation (1)) considering hydrogen-saturated and relaxed 
edges, as well as first-principles DFT calculations’’. The theoretical data 
points correspond to three classes of armchair ribbons: with 3n, 3n + 1 
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deposited on a Au(111) substrate. The circular insets show atomic-resolution 
STM images, confirming the crystallographic directions of the edges. The 
atomic-resolution images in the insets were Fourier filtered for clarity. The 
protrusions on the otherwise highly regular edges are imaging instabilities. 


and 3n + 2 (forn = 1,2, 3,...) rows of carbon dimers across their width. 
The measurement error and the emerging atomic-scale edge irregular- 
ities do not allow us to distinguish clearly between ribbons belonging to 
the 3n and 3n + 1 classes; however, armchair ribbon segments with no 
detectable gap have been observed in a few cases, which probably belong 
to the 3n + 2 class. Consequently, our measurements systematically 
verify the predicted mechanism of bandgap opening caused by the lat- 
eral confinement of the charge carriers in ribbons of precisely armchair 
edge orientation. 

A strikingly different behaviour is revealed in zigzag graphene nano- 
ribbons (Fig. 2b). For zigzag graphene nanoribbons narrower than 7 nm, 
we observed a fairly large bandgap of about 200-300 meV. As discussed 
above, the broadly predicted origin of bandgap opening in zigzag ribbons 
is electron—electron interactions and the predicted many-body ground 
state implies that the edges are magnetized'*"*”. 

However, in contrast to first principles theoretical predictions"®, the 
measured gap suddenly vanishes for zigzag ribbons wider than 8 nm. 
Weattribute this discrepancy to the fact that these calculations were per- 
formed at zero temperature and without doping, whereas the experi- 
mental data was acquired at room temperature and finite doping (the 
corresponding shift of the Fermi level AEy is about 50-100 meV; see 
Extended Data Fig. 2c). When the effects of temperature and doping 
were not taken into consideration, we found the ground state to be always 
semiconducting (antiferromagnetic) and no steep transition to a metallic 
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Figure 2 | Edge-specific electronic and magnetic properties of graphene 
nanoribbons. The bandgap measured by tunnelling spectroscopy as a function 
of ribbon width in armchair (a) and zigzag (b) ribbons. Armchair ribbons 
display a quantum confinement gap inversely proportional to their width 
(where n = 1, 2, 3,...; see main text). In zigzag ribbons the band structure is 
governed by the emerging edge magnetism and a sharp semiconductor 


(antiferromagnetic) to metal (ferromagnetic) transition is revealed. Theoretical 
data points were calculated using the mean field Hubbard model (continuous 
lines are only guides to the eye). Error bars of the measured gap values 
originate from thermal broadening and substrate effects; error bars for ribbon 
widths (horizontal) arise from tip convolution effects. 
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rr. #8 Figure 3 | Correlating electronic and magnetic 
—< -8 ® properties of zigzag graphene nanoribbons. Spin 
= pK p< density distribution (7, blue; |, red) in 5-nm-wide 
: ‘ i a 2 (left) and 10-nm-wide (right) zigzag graphene 
Lf \ Wd e nanoribbons calculated in the mean field Hubbard 
Ne ye model for T = 300 K and AE; ~ 100 meV. The 


lower panels display the corresponding band 
structure, clearly indicating that narrow zigzag 


ribbons are antiferromagnetic semiconductors, 
whereas the wider (>8 nm) zigzag ribbons display 
a ferromagnetic inter-edge coupling and no 
bandgap. ka is wavenumber times the lattice 


constant. 
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state is expected. To understand the origin of the experimentally observed, 
sharp semiconductor-metal transition, we performed calculations based 
on the mean-field approximation of the Hubbard Hamiltonian that 
accounts for both finite temperature and doping: 


H=— DY tyLget+ UD Ant — aN (1) 
Ly J 


where the first term is the nearest-neighbour tight-binding Hamilto- 
nian, and the second term stands for the onsite Coulomb repulsion. The 
effects of temperature and doping have been included using the grand 
canonical ensemble (third term), where j1 is the chemical potential, and 
Nis the particle number operator. The electron density and the chem- 
ical potential were determined self-consistently. (For more details on 
calculations see the Methods.) 

The main result of our extended theoretical model is that it repro- 
duces the observed steep semiconductor—metal transition, quantitatively 
accounting for both the critical ribbon width (about 7 nm) at which the 
transition occurs and the magnitude of the measured bandgaps, using 
the strength of the on-site repulsion Uas the single unknown parameter 
(Fig. 2b). By calculating the spin density distribution for both semicon- 
ducting (<7 nm wide) and metallic (>8 nm wide) ribbons the origin 
of the observed semiconductor-metal transition can be identified as a 
transition from an antiferromagnetic (semiconducting) state, where 
magnetic moments on opposite ribbon edges are aligned antiparallel, 
to a ferromagnetic (metallic) state, with parallel spin alignment on op- 
posite edges (Fig. 3). The metallic nature of zigzag graphene nanoribbons 
with ferromagnetic inter-edge coupling has already been predicted"*, 
together with a complex magnetic phase diagram as a function of dop- 
ing. Our theoretical results are consistent with these findings in the 
T— 0K limit. We note that the observation of the gap opening alone 
in zigzag graphene nanoribbons would not unambiguously indicate 
edge magnetism, given that reconstructed zigzag edge configurations 
(for example, Z21,) leading to semiconducting but non-magnetic zig- 
zag graphene nanoribbons have been predicted”*. However, only mag- 
netic zigzag edges can account for both gap opening and the observed 
semiconductor-metal transition. This, together with the quantitative 
agreement between our calculations and experiments provide indirect 
but compelling evidence that magnetic order emerges on zigzag graphene 
edges of precisely engineered crystallographic orientation. Moreover, it 
is remarkable that the signature of edge magnetism can be experiment- 
ally detected at room temperature, although this is in agreement with 
the expected high-temperature stability of s-p electron magnetism**”*, 
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as well as experiments reporting hints of room-temperature magnetism 
in defective graphitic samples*”””. 

Since the only free parameter used in the calculations for zigzag 
ribbons is the magnitude of the on-site repulsion U, the best quant- 
itative agreement with our experimental findings provides an experi- 
mental estimate for the strength of the electron-electron interaction in 
graphene of U = 3.24 eV. Theoretical works have predicted the mag- 
nitude of U to range from 2 eV to 6 eV (see, for example, ref. 30). The 
experimentally estimated value of U is even more reliable, because a 
single U value can reproduce both the magnitude of the gaps and the 
semiconductor-metal transition ribbon width. 

That the experimental results can be quantitatively interpreted by 
theoretical calculations on perfect zigzag (or armchair) edges confirms 
the experimentally found high edge quality. However, deviations from 
the ideal edge structure can be present on the atomic scale. To estimate 
the effect of atomic-scale edge irregularities, we performed calculations 
ona model system having a high density of such defects (Extended Data 
Fig. 3). The results indicate that, even in the presence of a high defect 
density, for overall zigzag oriented edges the qualitative picture holds 
but the strength of the effect (gap size, spin polarization) decreases. To 
fit the experimental data, the value of electron-electron interactions 
estimated based on the ideal zigzag edge has to be increased from U = 
3.24eV to U=4.32 eV. 

Our findings demonstrate that graphene nanoribbons display strongly 
edge-orientation-specific behaviour and that engineering the crystallo- 
graphic orientation of graphene edges allows us unprecedented control 
over both electronic and magnetic properties of graphene nanostruc- 
tures. This opens the way towards the realization of electronic and spin- 
tronic devices that are based on robust quantum mechanical effects and 
that can operate at room temperature. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


The graphene samples were grown by chemical vapour deposition on Cu foil, and 
transferred to a stripped gold substrate with large, atomically flat Au (111) terraces. 
The lithographic process and tunnelling microscopy/spectroscopy investigations 
were performed using a Nanoscope E Scanning Tunneling Microscope operating 
under ambient conditions. First, atomic-resolution STM images of the graphene 
lattice were acquired (bias voltage Uji,s = 5-50 mV and tunnelling current Ijunnel = 
1-2nA), to precisely identify the zigzag and armchair directions of the graphene 
lattice. For patterning, a bias voltage of 2.0-2.3 V (tip negative) is applied between 
the tip (90% Pt, 10% Ir) and the sample while the tip is slowly (1-5 nms_') moved 
along the desired cutting direction. The humidity of the cutting atmosphere (70%- 
75%) can be precisely controlled inside an atmospheric hood. Before the tunnelling 
spectroscopy measurements on ribbons, tunnelling spectra were acquired on gold 
to ensure that the spectra were linear and confirm that the tip was good quality and 
contamination-free. We acquired tunnelling spectra inside the ribbons to confirm 
the presence or absence of a bandgap. This method has proved more efficient and 
reproducible under ambient conditions than measuring strongly edge-localized peaks 
in the dI/dV spectra. We define the gap of the ribbon as the width of the plateau 
around zero bias in tunnelling spectra (Extended Data Fig. 2b). These plateaus are 
not entirely flat, but have a shallow slope, which we attribute to the presence of gold 
substrate underneath the atomically thin graphene nanostructures. The uncertainty 
(error bars) in the measured gap values mainly comes from the thermal broadening 
and the influence of the substrate, while for ribbon widths the uncertainty is due 
to the STM tip convolution effects. As control experiments, both before and after 
measuring the J-V characteristics of a ribbon, tunnelling spectra were acquired on 
the graphene sheet outside the ribbons. The (nonlinear) spectra measured on the 
ribbons were only taken as reliable if both before and after their measurement, the 
spectra of the unpatterned graphene displayed the expected closely linear I- V char- 
acteristics. The tunnelling I-V characteristics of graphene and metallic graphene 
nanoribbons on gold substrate are close to linear owing to the contribution of the 
high local density of state of the Au substrate. Within (along) the ribbons the mea- 
sured spectra were fairly homogeneous (Extended Data Fig. 4), apart from a few 
exceptions, which we attribute to local edge defects or impurities. 

To interpret the experimental results we considered the Hamiltonian of equa- 
tion (1). We applied the tig = fig — <ftig > + <i > identity and neglected the 
term that contains the fluctuations. Quantum Monte Carlo simulations including 
fluctuations found the agreement between mean-field theory and quantum Monte 
Carlo to be remarkably accurate for moderate Coulomb interactions, justifying the 
application of mean-field theory for the description of realistic ribbon geometries*’. 
Wearrived ata single-particle problem, which can be diagonalized by a generalized 
Bogoliubov transformation in k space: 


Hmean field = De (Ens bh) Ch ke Cnko —U > < aj o< nj = 
where C ka 
ticle with wavenumber k and spin o in band n. The energy bands are given by énkc, 
which depends on the yet unknown electron densities and chemical potential. The 
densities are calculated self-consistently, while the chemical potential is determined 
by the conservation of the number of particles. The self-consistent iteration was stopped 
when the difference between the electron densities was smaller than 10~ *. Itis known 
from the Hubbard-model theory that the half-filled case shows antiferromagnetic 


(Creo) are the transformed operators that destroy (or create) a par- 


correlations for U> 0, and that at a certain U- a Mott transition can take place. 
However, in the doped case both antiferromagnetic and ferromagnetic ‘ground’ states 
can be obtained. The solution which has the lowest free energy was accepted as the 
‘ground’ state. 

To provide accurate bandgap values for the graphene nanoribbons, we first cal- 
culated the relaxed edge geometries with hydrogen passivation of narrow (2 nm) zig- 
zag and armchair ribbons. We used density-functional-theory-based molecular 
dynamics calculations within the framework of local density approximation using 
the VASP»? simulation package. With the help of the relaxed edge coordinates of 
the atoms we were able to parameterize the tight-binding hopping elements in our 
Hubbard model. These hopping elements were applied to ribbons of various widths. 

In the VASP calculations projector augmented wave pseudo-potentials**** were 
used and the kinetic energy cut-off for the plane wave expansion was 400 eV. In all 
ribbon geometries the atomic positions were relaxed using the conjugate-gradient 
method until the forces of the atoms were reduced to 0.02 eV A '. Weused a rectan- 
gular super-cell with dx = 40 A in the x direction and dy = 2.46 A in the y direction 
for the zigzag graphene nanoribbon (4.26 A for the armchair graphene nanoribbon). 
The Brillouin zone was sampled using 2 k-points along the x axis and approximately 
8/dy k-points along the y axis (where dy is in nanometres). Vacuum layers of 20 A 
in the ribbon plane and 24.6 A in the normal direction were applied to avoid in- 
teractions between nanoribbons in different unit cells. 

From the relaxed carbon-carbon distances, we computed the hopping ampli- 
tudes using the parameterization”® 

“')| 


ry \ 
= (8) “orn 
where r; is the distance between the atoms i and j, ay = 1.42 A (the carbon-carbon 
distance in the bulk), «2 = 1.2785, «3 = 0.1383 and «4 = 3.4490. 

To estimate the maximum possible quantitative effect of edge irregularities on 
the magnetism, we performed the calculations presented here on a model system 
with edges of overall zigzag orientation, but containing the maximum amount of 
atomic-scale edge defects that does not completely destroy edge magnetism (Extended 
Data Fig. 3). We found that the qualitative picture of edge magnetism including the 
semiconducting (antiferromagnetic) to metallic (ferromagnetic) transition holds true 
for the defective zigzag edges, but the effect is substantially weakened compared to 
perfect edges. In particular, the calculated gap values are reduced. Therefore, the 
best fit to the experimental data occurs for a higher parameter value of the onsite 
repulsion parameter of U = 4.32 eV. Also, the width dependence of the gap becomes 
weaker, but this is still in good agreement with the measurements (Extended Data 
Fig. 3b) for the increased U parameter value. 
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Extended Data Figure 1 | The structure of graphene edges defined by STM —_ under specific imaging conditions (200 mV, 2 nA) can be attributed to the 
lithography. a, STM image (5 mV, 2 nA) of 15-nm-long edge segments cut presence of edge states that rule out the possibility of sp°-type edge terminations 
by STM lithography, revealing edges that are close to atomically smooth (for example, di-hydrogenated edges), because no edge states are expected 
(<5 A edge roughness) free of detectable reconstructions, contaminations or __ to occur for such edge configurations. 

curvature. b, The increased local density of states on zigzag edges observed 
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Extended Data Figure 2 | Scanning tunnelling spectroscopy of graphene 
nanoribbons on Au (111). Tunnelling I-V characteristics acquired on a 5-nm- 
wide armchair ribbon (a) displaying nonlinear I-V spectra corresponding to a 
gap of about 250 meV (b). Outside the ribbon a close-to-linear characteristic 
of the unpatterned graphene is revealed (c). The blue triangle and yellow square 
mark the positions of the corresponding tunnelling spectra. The insets show 
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the schematics of STM lithography (a), and the differential tunnelling 
conductance (dI/dV) obtained as numerical derivatives of the measured -V 
curves (b and c). The ~70-mV shift of the Dirac point (curve minimum) 
from the Fermi level (zero bias) observed on graphene (inset to c) is due to the 
doping from the Au(111) substrate and the ambient atmosphere. 
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Extended Data Figure 3 | The effect of edge irregularities on edge of defect-free zigzag edges). The experimental width dependence can be fitted 


magnetism. Calculated spin density distribution in the unit cell of a 3.3-nm- _ for defective ribbon edges by using higher values of the on-site repulsion 
wide zigzag ribbon with a high density of atomic-scale defects, revealing the parameter of U= 4.32 eV. 
substantial decrease of the emerging spin polarization (to about a third of that 
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Extended Data Figure 4 | Individual tunnelling I- V spectra. Tunnelling J-V 
characteristics recorded on various ribbons (the spectra have been shifted 
along the vertical axis for clarity). Each individual spectrum was recorded as 
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the average of ten voltage sweeps between —500 mV and +500 mV. The 
metallic (close to linear) or semiconducting (strongly nonlinear) nature of the 
ribbons is clearly apparent from the individual tunnelling I-V characteristics. 
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Stochastic transport through carbon nanotubes in 
lipid bilayers and live cell membranes 


Jia Geng'?**, Kyunghoon Kim!*"**, Jianfei Zhang’, Artur Escalada®, Ramya Tunuguntla’”’, Luis R. Comolli®, Frances I. Allen”’*”, 
Anna V. Shnyrova®, Kang Rae Cho!, Dayannara Munoz’, Y. Morris Wang”, Costas P. Grigoropoulos*, 
Caroline M. Ajo-Franklin?:?", Vadim A. Frolov?” & Aleksandr Noy)? 


There is much interest in developing synthetic analogues of biolog- 
ical membrane channels’ with high efficiency and exquisite selectiv- 
ity for transporting ions and molecules. Bottom-up’ and top-down’ 
methods can produce nanopores of a size comparable to that of en- 
dogenous protein channels, but replicating their affinity and trans- 
port properties remains challenging. In principle, carbon nanotubes 
(CNTs) should be an ideal membrane channel platform: they exhibit 
excellent transport properties* * and their narrow hydrophobic inner 
pores mimic structural motifs typical of biological channels’. More- 
over, simulations predict that CNTs with a length comparable to the 
thickness of a lipid bilayer membrane can self-insert into the mem- 
brane”’®. Functionalized CNTs have indeed been found to penetrate 
lipid membranes and cell walls'’"”, and short tubes have been forced 
into membranes to create sensors’*, yet membrane transport appli- 
cations of short CNTs remain underexplored. Here we show that short 
CNTs spontaneously insert into lipid bilayers and live cell membranes 
to form channels that exhibit a unitary conductance of 70-100 pico- 
siemens under physiological conditions. Despite their structural sim- 
plicity, these “CNT porins’ transport water, protons, small ions and 
DNA, stochastically switch between metastable conductance substates, 
and display characteristic macromolecule-induced ionic current block- 
ades. We also show that local channel and membrane charges can 
control the conductance and ion selectivity of the CNT porins, there- 
by establishing these nanopores as a promising biomimetic platform 
for developing cell interfaces, studying transport in biological chan- 
nels, and creating stochastic sensors. 

We created short CNTs using sonication-assisted cutting’ of puri- 
fied long CNTs with an average inner diameter of 1.51 + 0.21 nm in 
the presence of 1,2-dioleoyl-sn-glycero-3-phosphocholine (DOPC) lipid 
(Fig. la). This procedure produced a stable dark suspension (Fig. 1b, 
inset) containing short CNTs stabilized by the lipid coating. Transmis- 
sion electron microscopy (TEM) images of uncut and cut CNTs are shown 
in Extended Data Fig. 1, and CNT length and diameter distributions are 
shown in Extended Data Figs 2 and 3a, b, respectively. Raman spectros- 
copy confirmed that the cutting procedure preserved the inner diameter 
and rolled-up graphene-sheet structure of the nanotube (Extended Data 
Fig. 3c). TEM images of the purified cut CNTs (Fig. 1b, Extended Data 
Fig. 1) also indicate the presence of a significant population of short 
CNT fragments with lengths in the 5-15 nm range (Fig. 1d, Extended 
Data Fig. 2). 

Cryogenic TEM (cryo-TEM) images of the CNTs reconstituted into 
200-nm-diameter liposomes (Methods) reveal short CNT fragments in- 
serted into the lipid membrane and spanning both membrane leaflets 
(Fig. 1c; see also Extended Data Fig. 4 for wider field-of-view cryo-TEM 


images, and Extended Data Fig. 5 for additional higher-magnification 
images). The presence of CNTs does not affect the integrity of the lipid 
bilayer, and statistical analysis of the cryo-TEM images reveals a rela- 
tively wide length distribution of the inserted tubes (Fig. 1d) that indicates 
that insertion is not selective to a particular length. The perpendicular 
orientation of the CNTs to the membrane plane is strongly preferred, 
with the majority of the nanotubes tilting only 15° or less from the mem- 
brane normal (Fig. le). This observation disagrees with molecular dy- 
namics simulations that predict tilting of CNTs longer than the thickness 
of the bilayer (4.6 + 0.2 nm for the DOPC membranes) to maximize the 
interactions of the hydrophobic bilayer core with the hydrophobic CNT 
walls’. Finally, for most inserted CNTs at least one tube end abutted the 
hydrophilic head groups of the lipid bilayer, allowing for energetically 
favourable interactions of the hydrophilic groups on the tube rim with 
the hydrophilic lipid headgroups. Because the short inserted CNTs have 
a graphitic structure and functionality resembling the B-barrel structure 
of porins, the water-filled transmembrane proteins that act as pores 
through which molecules can diffuse, we refer to them as CNT porins. 

We explored the proton transport behaviour of CNT porins by 
placing liposomes containing an encapsulated pH-sensitive dye (8- 
hydroxypyrene- 1,3,6-trisulphonic acid trisodium salt, HPTS) at pH 8.0 
into isosmotic buffer solution at pH 3.0. With liposomes lacking CNT 
porins, the background proton leakage through the vesicle membrane 
led to a slow HPTS fluorescence decay with a characteristic timescale 
of 187s (Fig. 1f). Conversely, samples that contained CNT porins in 
the bilayer showed much faster initial fluorescence decay with a 35s 
timescale, indicating the opening of a second pathway for protons to 
enter the vesicle lumen. The overall decay kinetics was described by a 
sum of two exponential terms, with the second term matching the 187 s 
timescale obtained in a control experiment (Fig. 1f, inset), indicating 
that a small subpopulation of the liposomes did not contain CNT porins. 

To explore transport through the CNT porins further, we placed lipo- 
somes containing pure water into dilute NaCl and K,SO, solutions 
and used dynamic light scattering to monitor liposome size changes re- 
sulting from osmotically driven water expulsion’*. Control liposomes 
without CNT porins showed minimal (<2.3%) size changes, whereas 
liposomes incorporating CNT porins shrank by up to 20% (Fig. 1g). CNT 
porin vesicle shrinkage depends strongly on the ionic strength of the 
outside solution, being highest at low ionic strength and reducing dra- 
matically at high ionic strengths to approach values measured for the 
control samples. This behaviour can be understood by considering that 
at pH 7.0, the carboxylic acid groups at the CNT porin entrance are 
negatively charged, creating an electrostatic barrier for the anion pas- 
sage; this barrier slows down the ion leakage into the liposome lumen 


1Biology and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, USA. School of Natural Sciences, University of 
California, Merced, California 95340, USA. >The Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. “Mechanical Engineering Department, University of California, 
Berkeley, California 94720, USA. *Biophysics Unit (CSIC, UPV/EHU) and Department of Biochemistry and Molecular Biology, University of the Basque Country, 48940 Leioa, Spain. °Life Sciences Division, 
Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. Department of Materials Science and Engineering, University of California, Berkeley, California 94720, USA. ®Materials Sciences 
Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. ?National Center for Electron Microscopy, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. 
10Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550, USA. !!Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 


94720, USA. Ikerbasque, Basque Foundation for Science, 48011 Bilbao, Spain. 
*These authors contributed equally to this work. 


612 | NATURE | VOL 514 | 30 OCTOBER 2014 


©2014 Macmillan Publishers Limited. All rights reserved 


e CNT tilt 
angle (deg) 
30 
75 
90 
0 5 10 15 20 2 4 6 


CNT length (nm) 


f 10 = g 
34 o 
; 1.04.33. 
g ee 
_ 08 8 & ~. . 
3 5 04 a - 3, 
& a C 0.94 . a 
® 06 2 fo.) . 
g” 8 5 . 
5 = 0.01 g , 
8 5 A 
2 500 1,000 2 % rae 
8 oA Time (s) S 9. ore 
8 2 : re 
oar — DOPC only 2 $O5-9- > 
; — DOPC+CNT 3 NaCl DOPC only 
=; © NaCl DOPC+CNT 
©KSO, DOPC+CNT 


4 5 678910 20 
Debye length (nm) 


200 400 600 800 
Time (s) 


1,000 1,200 2 3 


Figure 1 | Synthesis, characterization, and bulk transport properties of 
CNT porins. a, Schematic showing CNT porin preparation and incorporation 
into liposomes (see text for details). b, Left, TEM image of cut CNTs stabilized 
by lipid coating; boxed areas highlight cut CNTs; boxes labelled 1-4 are 
shown magnified at right. Inset, a photograph of cut CNT suspensions before 
and after centrifugation. c, Cryo-TEM image of a CNT porin. Inset, overlaid 
outlines of the bilayer (red) and CNT (blue). Image was processed to enhance 
contrast (see Extended Data Fig. 5 for unfiltered images of this and other 
CNT porins). d, Histogram of the lengths of the CNTs inserted into the lipid 
membrane and a Gaussian fit to the data (n = 20). e, Radius plot of the 
histogram of CNT tilt angles measured relative to the axis normal to the bilayer 
plane (n = 20). f, Time traces of the fluorescence of HPTS dye encapsulated 
in liposomes with (red trace) and without (blue trace) CNT porins after outside 
pH was lowered from pH 8 to pH 3. Dashed lines, single exponential fits; 
dash-dotted line (red trace), a sum of two exponential terms. a.u., arbitrary 
units. Inset shows the plot on a log-linear scale. g, Plot of the liposome size 
change D/Dp (where D stands for liposome diameter and Dp for the initial 
diameter) after exposure to NaCl (red circles and blue squares) and K,SO4 
(red diamonds) osmotic gradients. Dashed lines, fits to sigmoidal function. 
Error bars, s.d. (n = 4-5). Inset illustrates ion rejection by the carboxylic acid 
groups at the CNT porin entrance. 


but not water outflow, allowing the osmotic pressure to shrink the ves- 
icles (Fig. 1g, inset). At higher ionic strength these entrance charges are 
screened, allowing ions to pass through the CNT pore to neutralize the 
osmotic gradient. Indeed, the transition from high to low ion rejection 
in our measurements occurs at the same Debye length ranges (6-7 nm 
for NaCl and 4-5 nm for K,SO,) that we previously observed for bulk 
CNT membranes”, confirming that the observed effects are closely related 
to ion diffusion through the CNT pores. 

We observed spontaneous membrane incorporation and character- 
ized transport properties of individual CNT porins using single-channel 
conductance measurements (Fig. 2a). When we added CNT porins to 
the cis-chamber of the setup, the bilayer conductance showed a series 
of characteristic amplitude jumps (Fig. 2b), indicating membrane in- 
corporation events. Conductance traces contained a mixture of well- 
separated events and multiple steps corresponding to a relatively quick 
succession of incorporation events. The histogram of conductance step 
values (Fig. 2c) shows clear evidence of quantization, with the first peak 
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Figure 2 | Transport through individual CNT porins. a, Set-up for single- 
channel recording of CNT porin conductance where a small patch of lipid 
bilayer separates two solution chambers (labelled cis and trans). The 
transmembrane potential is set by the reference electrode (see Methods for 
details). b, Conductance traces showing incorporation of single (i, ii) and 
multiple (iii, iv) CNT porins into the lipid bilayer. Control trace was recorded in 
absence of CNTs. c, Histogram of CNT porin conductance values (n = 236). 
Solid blue line is a sum of two Gaussian peaks where the position and width 
of the second peak are determined from the position, 1, and width, a, of the 
first peak as 24 and \/2c. d, Conductance traces and normalized conductance 
histograms showing stochastic ‘gating’ transitions for one (i, ii) and two (iii) 
CNT porins. 


corresponding to the 0.63 + 0.12 nS value for a single CNT pore con- 
ductance anda subsequent peak at exactly double that value. Occasion- 
ally, we recorded conductance jumps at higher multiples of the single 
pore conductance value, but the frequency of those events was low. Re- 
markably, despite a relatively wide distribution of CNT porin lengths, 
their unitary conductance is well defined. Cryo-TEM images argue strongly 
against the possibility that only a narrow subset of short nanotubes in- 
corporates into the lipid membrane. Thus, we conclude that the con- 
ductance of CNT porins isa relatively weak function of the CNT length 
and instead is determined mainly by the interfacial resistances of the 
pore ends”. Interestingly, the 0.63 nS conductance ofa CNT porin is close 
to the 0.65-0.73 nS conductance of an o-haemolysin channel’’, which 
has a 10 nm-long pore with a width that varies from 1.4 nm in the neck 
region to 4.6 nm in the vestibule region. A much wider range of con- 
ductance values reported for acid-cut CNTs’’ could be attributed to the 
microinjection protocol used to insert these pores into the membrane. 

CNT porin conductance scaled linearly with salt concentration in the 
0.5-2 M range both at neutral pH and at pH 2 (Extended Data Fig. 6), 
strongly suggesting that the dominant conductance mechanism through 
the CNT porin is ionic. The unitary conductance of a CNT porin at pH 2 
was reduced by almost a factor of 2 compared to neutral pH, from 
0.63 nS to 0.33 nS (Extended Data Fig. 7b). This observation rules out 
proton conductivity as the dominant current-carrying mechanism in 
these pores and suggests that the conductance of the CNT porins is sen- 
sitive to the charges at the pore mouth (which are neutralized at pH 2). 
A similar decrease in channel conductance at acidic pH due to the changes 
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in the charge state of the pore residues has been reported for OmpF 
porins””. 

For less than 30% of the CNT porins, the conductance signal oscillated 
between ‘open’ and ‘partially-closed’ sub-states after insertion, showing 
distinct peaks in the histograms of the trace amplitude (Fig. 2d). Nota- 
bly, we did not observe these low conductance sub-states at pH 2 (Ex- 
tended Data Fig. 7a, c). Computer simulations of pore insertion into 
lipid bilayers indicate that rapid removal and re-insertion of the CNT 
porins from the bilayer cannot cause these oscillations’. The ionic nature 
of the conductance in CNT porins argues against the possibility that the 
oscillations originate from the ion blocking of the proton-based cur- 
rent as previously reported for macroscopically-long CNT channels°. 
Transient blocking by nanoprecipitation” requires the presence of di- 
valent ions and produces distinct triangular shaped blockades differ- 
ent from the ones we observed. Another possible cause, spontaneous 
wetting/dewetting transitions in hydrophobic nanopores”, should have 
produced a clear zero-conductance ‘off state; however, our system fre- 
quently shows only a partially-blocked low conductance sub-state (Ex- 
tended Data Fig. 8b, c), suggesting that the CNT interior remains filled 
with water’. 

Conductance oscillations in CNT porins are similar to the stochastic 
‘gating’ behaviour of biological ion channels. Unlike biological chan- 
nels, however, where conformation transitions are commonly invoked 
as a cause of conductance sub-states, CNT porins contain no movable 
parts and the entire pore structure is rigid. Similar transitions have been 
reported previously for ion-track-etched polyethylene terephthalate solid- 
state nanopores” and glass—polymer nanogaps”’. One possibility is that 
these sub-states represent a spontaneous transition between ionic- 
penetration and ionic-exclusion states, as predicted”* for weakly charged 
or neutral nanopores embedded in a dielectric medium. This model pre- 
dicts the existence of low-conductance sub-states for charged nanopores 
and on/off transitions for uncharged pores. Consistent with this pre- 
diction, the low conductance states disappear at pH 2. Although we find 
some parallels with the behaviour of conductance sub-states reported 
for OmpF porin channels”, further experimental work is necessary to 
understand the detailed nature of conductance fluctuations. 
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Figure 3 | DNA translocation through CNT ion channels. a, Schematic 
showing the translocation of single-stranded DNA through a CNT porin in the 
lipid bilayer (see text for details). Symbols in circles indicate ground electrode 
(top) and reference electrode (bottom). b, Current trace showing multiple 
transient blockades caused by 81-nt ssDNA translocation through the CNT 
channel (middle) with magnified view (bottom). The top trace shows the 
control trace recorded in absence of ssDNA. The applied voltage was —50 mV. 
c, d, Histograms of conductance blockade levels (c) and dwell times (d) for 
806 ssDNA translocation events. 
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When exploring electrophoretic translocation of macromolecules 
through individual CNT porins with 81-nucleotide-long single-stranded 
DNA (81-nt ssDNA) oligomers added to the trans-chamber of our set- 
up (Fig. 3a), we observed rapid conductance blockades as individual 
ssDNA strands passed through the CNT pore (Fig. 3b). The conduc- 
tance blockade values followed a normal distribution centred at 0.48 + 
0.08 nS (Fig. 3c), which falls between the 0.8 nS blockades reported for 
wild-type «-haemolysin pores” with a diameter similar to that of CNT 
porins and the 0.3 nS blockades reported for a hybrid o1-haemolysin/ 
solid-state nanopore structure*®. DNA translocation dwell times followed 
alog-normal distribution (Fig. 3d) centred around a value of 53 ms, which 
corresponds to an average DNA translocation speed of 1.5 nt per ms. This 
value is comparable to the speed achieved by biological nanopores and 
falls within the range suitable for DNA analysis applications”. 

Finally, we used patch-clamp measurements to demonstrate that CNT 
porins can spontaneously incorporate into the membranes of live Chi- 
nese hamster ovary (CHO) and human embryonic kidney (HEK293T) 
cell lines. For both CHO cells (in 5 out of 6 trials) and HEK293T cells 
(in 13 out of 15 trials), we detected channel-like activity emerging 60- 
90s after establishing a tight contact between a cell and patch-pipette 
containing CNT porins (Fig. 4a, see Extended Data Fig. 9 for examples 
of longer traces). In both cases the dominant activity corresponded to 
a unitary conductance of approximately 70 pS (68.1 + 0.8 pS for HEK 
293T cells and 67.6 + 1.4 pS for CHO cells) and its multiples (Fig. 4b, c, 
also see Extended Data Fig. 9). Control experiments performed with- 
out CNT porins showed no endogenous channel activity in CHO cells 
(10 out of 10 cells). HEK293T cells showed endogenous channel activ- 
ity with unitary conductance of 30.6 + 0.3 pS (Fig. 4b, Extended Data 
Fig. 9c, 5 out of 5 cells), which was also present as a 21.9 + 0.7 pS peak 
in the data obtained for interactions of HEK293T cells with CNT porins 
(Fig. 4b). Importantly, CNT porins incorporated into cell membranes 
display low-conductance sub-states (Extended Data Fig. 9e), comple- 
tely mimicking the conductance dynamics observed in planar lipid 
membranes. 

We detected similar single-channel activity using patch-clamp ex- 
periments on lipid bilayers, corroborating spontaneous incorporation 
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Figure 4 | CNT porin incorporation into live cell membranes. 

a, Representative conductance traces for CNT porin incorporation and gating 
in CHO (trace ii) and HEK293T (trace iv) cells (—60 mV holding potential, 
150 mM KCl)). Traces i and iii were recorded in the absence of CNT porins. 
Dashed lines are added as a guide to the eye. b, c, Histograms of conductance 
steps for CNT porin incorporation into HEK293T (b; blue bars, n = 2,467, 
13 cells) and CHO (c; red bars, n = 657, 5 cells) cell membranes fitted to a 
sum of Gaussian peaks (solid lines). See Methods for peak positions. Native 
channel activity for HEK293T cells (green bars and black solid line, n = 227, 
5 cells) is also seen with CNT porins. 
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of the CNT porins into the lipid membrane patch inside the patch- 
pipette. Notably, the unitary conductance detected in these experiments 
depended on the lipid composition (Extended Data Fig. 10a). In charged 
membranes containing the core ingredients of the plasma membrane, 
the measured conductance was 64.7 + 0.5 pS (at 150 mM KCl), closely 
corresponding to the dominant unitary conductance detected in cellular 
membranes. We observed similar conductances for CNT porins pre- 
reconstituted into giant unilamellar vesicles (Extended Data Fig. 10b, e), 
demonstrating that the activity of CNT porins does not depend on the 
membrane incorporation pathway. However, when we excluded charged 
lipid species from the bilayer composition, the unitary conductance 
shifted to 92.5 + 0.6 pS (at 150 mM KCl). This conductance level cor- 
responds to a conductance of 0.62 nS after correcting for the difference 
in electrolyte concentration (150 mM versus 1 M KC]), in excellent agree- 
ment with the value from the planar lipid bilayer experiments (Fig. 2c). 
The origins of the modulation in conductance caused by the membrane 
surface charge need to be clarified by follow-up studies. 

Taken together, our findings establish CNT porins as a promising 
prototype of a synthetic membrane channel with inherent robustness 
towards biological and chemical challenges and exceptional biocompat- 
ibility that should prove valuable for bionanofluidic and cellular inter- 
face applications. We expect that our CNT porins could be modified 
with synthetic ‘gates’”* to dramatically alter their selectivity, opening up 
possibilities for their use in synthetic cells, drug delivery, and biosensing. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Materials. 1.5 nm diameter single-walled carbon nanotubes were purchased from 
NanoLab, Inc. (Waltham, Massachusetts, USA, product purity >95%). 1,2-dioleoyl- 
sn-glycerol-3-phosphocholine (DOPC), 1,2-dioleoyl-sn-glycero-3-phosphoetha- 
nolamine (DOPE), 1,2-dioleoyl-sn-glycero-3-phospho-L-serine (DOPS), choles- 
terol (Chol), 1,2-dioleoyl-sn-glycero-3-phosphoinositol-4,5-bisphosphate (PI(4,5)P2) 
and 1,2-dioleoyl-sn-glycero-3-phosphoethanolamine-N-(Lissamine Rhodamine B 
sulphonyl) (Rh-DOPE) were purchased from Avanti Polar Lipids, Inc. (Alabaster, 
Alabama, USA). N-decane was purchased from Fisher Scientific. 8- Hydroxypyrene- 
1,3,6-trisulphonic acid trisodium salt (HPTS), hexane, and gramicidin A from Ba- 
cillus brevis were purchased from Sigma-Aldrich (St Louis, Missouri, USA) and 
Invitrogen (Carlsbad, California, USA). Short single-stranded DNA oligonucleo- 
tides (81-nt) were purchased from Integrated DNA Technologies, Inc. (Coralville, 
Iowa, USA). 

CNT cutting. To remove amorphous carbon and impurities from the CNT sam- 
ple, 1.6 to 2 mg of as-received CNTs were first purified in a thermal gravimetric anal- 
ysis (TGA) system (Q5000 IR TGA-MS, TA Instruments, New Castle, Delaware, 
USA) in 25 ml min“! stream of air while the temperature was ramped at a rate of 
5°Cmin | from 20°C to 450 °C. Ina typical CNT porin preparation procedure, 
2 ml of DOPC (10 mg ml * in chloroform) was added to a 20 ml glass vial and the 
solvent was evaporated using a V10 evaporator (Biotage, Uppsala, Sweden) to form 
a thin lipid film on the walls of the glass vial. 1.6 mg of purified CNTs and 20 ml DI 
water (Milli-Q, Millipore, Boston, Massachusetts, USA) were added to the same 
vial. Following 1 h of bath-sonication (Model 1510, Branson Ultrasonics, Danbury, 
Connecticut, USA), the sample was probe-sonicated for 16 h (Model VC 100, Sonics 
& Materials Inc., Newtown, Connecticut, USA) at 100 W power delivered in 3 s pulses 
separated bya 1 s pause. During sonication, the vial was continuously cooled in an 
ice-water bath and, ifnecessary, DI water was added to compensate for loss in sam- 
ple volume due to evaporation. 

Purification and incorporation of CNT porins into liposomes. To separate the 
cut CNTs from the uncut nanotube material, 10 ml of suspension processed by son- 
ication cutting was added to a 15 ml centrifuge tube and centrifuged at 4,000 r-p.m. 
for 1h using an Allegra X22 centrifuge (Beckman-Coulter, Indianapolis, Indiana, 
USA). The dark supernatant solution containing lipid-stabilized cut CNTs was 
stable and could be dried and re-suspended in DI water, suggesting that the lipid 
coating helped to prevent the aggregation of the cut CNTs. 

To incorporate CNTs into DOPC liposomes, a 0.2 ml aliquot of 10 mg ml’ DOPC 
in chloroform was added to a 20 ml glass vial and the solvent was evaporated to leave 
a dried lipid film. 1 ml of purified CNT/DOPC complex solution was added to the 
vial and the mixture was allowed to hydrate for one hour. The solution was then 
extruded through a 200-nm-diameter pore polycarbonate membrane using a hand- 
held Mini-Extruder device (Avanti Polar Lipids) for 10 cycles. 

Cryo-TEM specimen preparation and instrumentation. 5 ul aliquots of lipo- 
somes incorporating CNT porins (as well as vesicle-only control samples) were placed 
onto 200 mesh carbon-stabilized lacey-Formvar copper grids (#01881,Ted Pella, 
Inc., Redding, California, USA), manually blotted with filter paper, and plunged 
into liquid ethane at liquid nitrogen temperature. For control studies of isolated cut 
and uncut CNTs with and without lipids, 5 ,1l aliquots were placed on 200 mesh 
carbon-stabilized Formvar-film copper grids (#01800, Ted Pella, Inc.), blotted with 
filter paper and then air-dried. All TEM grids were glow discharged. Samples were 
transferred to a JEOL-3100-FFC transmission electron microscope equipped with 
a field emission gun (FEG) operating at 300 kV, an Omega energy filter, a Gatan 
795 4K X 4K CCD camera (Gatan Inc., Pleasanton, California, USA) mounted at 
the exit ofan electron decelerator operated at 200 kV resulting in images formed by 
a 100kV electron beam at the CCD, and a cryo-transfer stage. The stage was cooled 
with liquid nitrogen to 80 K during the acquisition of all data sets. 

Cryo-TEM image acquisition and analysis. Images were recorded at a nominal 
magnification of 40,000 giving a pixel size of 0.14nm at the specimen, either 
without camera binning or using a camera binning factor of two. Underfocus values 
ranged from 2 + 0.25 tum to 3.6 + 0.25 jm, and energy filter widths were typically 
around 30 eV. The surveys of cryo samples and the selection of suitable targets were 
performed in low-dose defocused diffraction mode. Images were acquired under 
low-dose conditions, with doses of ~20-40e A~2 per image. For the control sam- 
ples of cut and uncut CNTs with and without lipids, images were acquired at 
80,000 giving a pixel size of 0.07 nm at the specimen. Several dozen images were 
acquired with defocus values ranging from minimum contrast to 2 jum underfocus. 
Through-focus series were also acquired. Images acquired at 2 jm underfocus with 
camera binning factor of two, giving a pixel size of 0.14 nm, agree well with the images 
obtained for the cryo vesicle-CNT samples acquired at the same defocus. The pack- 
age of imaging tools Priism (http://msg.ucsf.edu/IVE/) was used for noise reduction 
and contrast enhancement. Image] 1.38x software (NIH, http://rsb.info.nih.gov/ij/) 
was used for analysis and measurements of the 2D image projections. Since the 
images represent a 2D projection of a 3D structure, the measured CNT length 


values represent the lower boundary estimate of the true CNT lengths. To char- 
acterize the length- and tilt-angle-distribution (Fig. 1d and e, respectively) of short 
CNTs before and after insertion into lipid bilayers, we manually measured the length 
of identifiable individual CNTs from a series of high-resolution TEM images (Fig. 1c, 
Extended Data Figs 1 and 2a). Only CNTs with relatively high signal-to-noise were 
selected for analysis. 
Conventional TEM specimen preparation and imaging. Control samples of iso- 
lated cut and uncut CNTs were also investigated (Extended Data Figs 1a, b and 2b, c) 
with a room-temperature stage using a Philips CM300 TEM, operated at 300 kV with 
the field-emission gun extraction voltage set at 4.2 kV. Specimens were prepared by 
first dispersing the CNTs in ethanol using 30 min of bath sonication, and then drop- 
casting onto lacey-carbon TEM copper grids. The average diameter for the uncut 
CNTs was 1.51 + 0.21 nm, as measured over a dozen individual tubes. 
Atomic force microscopy (AFM) imaging. AFM imaging was performed using a 
Multimode Nanoscope IIIA, Nanoscope VIII AFM (Digital Instruments, Santa 
Barbara, California, USA), and Asylum Research MFP-3D (Asylum Research, Santa 
Barbara, California, USA), both operating in tapping mode. For imaging in air, we 
used silicon tips (PPP-FMR, Nanosensors GMBH, Germany) with nominal probe 
radius of less than 7 nm, force constant of 2.8 Nm‘, and resonance frequency of 
75 kHz. 10 pl of purified CNT solution was deposited onto freshly cleaved mica pre- 
treated with poly-L-lysine solution (Ted Pella) to facilitate adsorption of negatively- 
charged CNTs, allowed to incubate for 2 min, rinsed with DI water, and dried with 
nitrogen. AFM images typically represent a true height (diameter) value of the CNTs, 
and the average CNT height observed in the AFM images, 1.54 + 0.44 nm (Extended 
Data Fig. 3a), matches the CNT diameter value determined from the cryo- TEM 
and Raman measurements (Extended Data Figs 3b and c, respectively). 
Raman spectroscopy. A droplet of CNT/DOPC suspension was placed onto a glass 
cover slip, dried and placed into a Renishaw micro-Raman spectrometer (Renishaw, 
Hoffman Estates, Illinois, USA) using 488 nm laser illumination. The Raman spec- 
trum for short CNTs showed the typical CNT bands at ~1,600 cm’ (G band) and 
~1,300cm | (D band), and show a G/D ratio of 8 (Extended Data Fig. 3c), indi- 
cating that the CNT walls maintain a graphitic structure after sonication cutting. 
Proton transport assay. To prepare the CNT-containing liposomes loaded with a 
pH-sensitive fluorescent dye, a 0.1 ml aliquot of 10 mg ml‘ DOPC in chloroform 
was added to a 20 ml glass vial, and dried into a film by evaporating the solvent. 
1 ml of CNT porin (1 mg ml ') solution in 1 mM HPTS and 150 mM KCl (titrated 
to pH 8), were added to the dried lipid film, allowed to rehydrate for 1 h and then 
extruded through a 200 nm diameter pore polycarbonate membrane. For prepara- 
tion of a control sample, 0.2 ml aliquot of 10 mg ml” ' DOPC in chloroform was 
dried to a film, rehydrated with 1 ml of 1 mM HPTS and 150 mM KCI (pH = 8), 
and extruded the same way. Unencapsulated external dye was removed using a 
size exclusion chromatography (SEC) column packed with Sepharose CL-6B resin 
(Sigma-Aldrich). After we inserted the CNT porins into the liposome walls we did 
not observe any leakage of the HPTS dye from the liposomes, indicating that the 
CNT porins were impermeable to HPTS. All solutions used for these experiments 
contained a 150 mM isosmotic concentration of background KCl electrolyte to 
ensure the absence of osmotic pressure gradients. 

HPTS dye emission (excitation 450 nm, emission 514 nm) was monitored using 
a FluoroMax-4 fluorometer (Horiba Inc. Albany, New York, USA). For kinetic 
measurements, 0.2 ml CNT-liposome sample (or 0.2 ml control liposome sample) 
was added to the cuvette containing 2 ml of 150 mM KCl (pH = 3). Solution was 
continuously stirred with a magnetic stir bar during measurements. 
Osmotic-pressure-induced liposome size change measurement. 0.2 ml of 
10mgml * DOPC solution in chloroform was added to a vial and dried to a film 
using a Biotage V-10 evaporator. 1 ml of CNT porin solution was added to the vial 
and extruded through polycarbonate filter membrane with 200 nm diameter pores 
using a hand-held extrusion device (mini-extruder, Avanti Polar Lipids). A control 
sample was prepared in a similar way, except that 0.3 ml of the DOPC solution was 
used to prepare a lipid film and water was added instead of CNT porin solution. 

Hydrodynamic radius of the liposomes was measured using a dynamic light scat- 
tering set-up (Zetasizer, Malvern Instruments, Malvern, UK). Ina typical measure- 
ment, we mixed 10 1] of the liposome solution with 70 pl NaCl solution of known 
concentration in a low-volume cuvette cell (Malvern Instruments). 
Single-channel recordings of CNT porin conductance. A planar lipid bilayer was 
formed over a 200 j1m diameter aperture in a Teflon film partition installed in a 
two-chamber BLM cell (Eastern Scientific LLC, Rockville, Maryland, USA) using 
a painting technique”’. The trans-chamber (connected to a reference Ag/AgCl elec- 
trode) and cis-chamber (connected to a ground electrode Ag/AgCl) were filled with 
2 mland 0.2 ml conducting buffer (KCl or NaCl with 5 mM HEPES, pH 7.4) respec- 
tively. During the measurement a 2 1] solution of DOPC liposomes containing CNT 
porins or the stock CNT porin solution was added to the cis-chamber. 

A holding potential between —200 mV to 200 mV was applied to the reference 
electrode, and the trans-membrane current signal was recorded by a patch clamp 
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set-up, which consisted of an Axopatch 200B patch clamp amplifier and an Axon 
DigiData 1322A analogue-digital converter (Axon Instruments, Milpitas, Califor- 
nia, USA) connected to a computer system running Clampex 10.3 software (Axon 
Instruments). Traces were acquired at a sampling frequency of 10-100 kHz and 
were low band-pass filtered at a frequency of 5 kHz or 2 kHz to increase signal-to- 
noise ratio. The data were analysed and exported using PClamp 10.3 software (Axon 
Instruments), and further analysed using Origin Pro 8.0 and Igor Pro 6.31. 

As a positive control, the conductance of gramicidin channels was measured 

using the following procedure: 0.1 mg gramicidin A was incubated with 200 pl of 
the DOPC liposome solution (150 mM KCl, 4 mg ml ‘lipid concentration, pH 7.4). 
After a lipid bilayer was formed over the testing set-up aperture (in the presence of 
1M KCl solution in both chambers), 2 pl of gramicidin A/liposome solution was 
added into the cis-chamber and the conductance traces were recorded using a trans- 
membrane potential of 75 mV. The conductance measured for gramicidin was 
0.058 + 0.008 n$, which is close to the reported values of 0.0459 + 0.005 pS*. 
DNA translocation measurements. We used a similar set-up and sample prepa- 
ration technique to that described in the preceding section. ssDNA oligonucleotide 
(5'-/5Phos/GCG GCC GCT ACT AGT CTT ACC GCC ACC CAG AGG GCC 
ATA ACG GGT ACG GTA TTG GCT TAC ACG GTT ACG CAG ACG CTG 
TAC-3’, 81 nt length) was pre-mixed into the solution (1M KCl, pH 7.4) in the 
chambers to a final concentration of 100 pM. After a planar lipid bilayer was formed, 
2 pl of CNT-containing liposomes was added into the solution in the cis-chamber 
by pipette. A holding or ramping potential of 50 mV maximum amplitude was 
applied across the bilayer. For the control experiment no ssDNA was added to the 
chambers. Translocation events from current traces of a total of 338s duration 
were counted either manually or using custom-written software in Igor Pro (Wave- 
metrics, Inc., Lake Oswego, Oregon, USA). 
Cell culture and on cell patch-clamp measurements. CHO and HEK 293T cells 
were cultured according to the prescribed protocol (ATCC website), at 37 °C, 5% 
CO,, in 25cm? flasks, in DMEM (HyClone SH30243.01 High Glucose) supple- 
mented with gentamicine antibiotic (5 mg1™ 1) and 10% serum (DE14-801E, Lonza, 
Allendale, New Jersey, USA). These cells are known for low endogenous channel 
activity and, consequently, widely used for heterologous expression of ion channels. 
24h before the experiment, the cells were plated in 35 mm Petri dishes (with plastic 
bottom observation window, 35mm low u-Dish cat#80136, Ibidi, Martinsried, 
Germany) using PBS without Ca**/Mg”* (D8537, Sigma, St Louis, Missouri, USA). 
For the patch-clamp experiment the culturing media was substituted with external 
solution (PBS with Ca’*/Mg"*, Sigma, D8662), and the experiments were con- 
ducted within the next 45 min, at room temperature. 

Patch-pipettes were produced from borosilicate glass (GB150-10, Science Pro- 
ducts, Hofheim, Germany) using a Sutter P-2000 puller (Sutter Instruments, Novato, 
California, USA) and filled with the solution containing (in mM): 150 KCl, 10 
HEPES, and 1 EDTA pH 7.0; the pipette resistance was 3-5 MQ. CNT porin solu- 
tion was added to the pipette solution before each experiment. We used 0.15% 
dilution of the CNT porin stock solution in the pipette buffer, a higher amount of 
the CNT interfered with the gigaseal formation. The data were obtained and pre- 
processed using an EPC10 patch-clamp amplifier (HEKA Electronik, Lambrecht/ 
Pfalz, Germany), at 5 kHz sampling rate with the 2.9 kHz internal filter of the EPC10. 
At the beginning of each record we applied a series of holding potential steps (from 
—60 mV to +60 mV) to estimate the reversal potential. For data acquisition, we used 
cells with approximately symmetric current response (Extended Data Fig. 10e), 
using —30 mV and —60 mV holding potentials which induced minimal endogen- 
ous channel activity. Stable conductance levels were detected using FITMASTER 
software (HEKA). The background (reference) conductance was determined from 
an all-point histogram. The Gaussian fits to the conductance histograms for 
HEK293T cells (Fig. 4b, blue bars, red solid line) give the peak values of 68.1 + 
0.8, 122.7 + 1.0, 177.8 + 2.1, 223.4 + 2.7 pS. For CHO cells (Fig. 4c, red bars, blue 
solid line) the peak values were 67.6 + 1.4, 130.8 + 1.3, 195.3 + 1.1, and 255.3 + 
10 pS. 
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Patch-clamp measurements on planar lipid bilayer and giant unilamellar 
vesicles. Planar lipid bilayers were prepared using a similar methodology as above, 
based on the modified Montal-Mueller ‘painting’ procedure described elsewhere*’. 
The bilayers were produced from 20g1~’ solution of lipids in squalane; charged 
membranes contained (in mol%): 28% DOPC, 25% DOPE, 30% Chol, 15% DOPS, 
and 2% PI(4,5)P2; non-charged membranes contained (in mol%) 50% DOPC, 20% 
DOPE, and 30% Chol. Membranes were formed on 300x mesh copper TEM grids 
(Pelco, 1GC300) mounted on the bottom of the Petri dish using adhesive spacers 
(Grace BioLabs Inc., Oregon, USA) as described previously*’. Grids were pretreated 
with the appropriate combination of lipids dissolved in 1:1 mol/mol octane:decane, 
20 gl” (ref. 31). The bilayers were formed in the working Buffer 1, containing (in 
mM): 150 KCl, 10 mM Hepes, and 1 mM EDTA, pH 7. 

Giant unilamellar vesicles (GUVs) were produced by spontaneous swelling of 
lipid films deposited on silica beads. The ‘charged membrane’ composition was 
used, supplemented with 0.5 mol% of Rh-DOPE for fluorescence microscopy de- 
tection of GUVs. The GUV preparation was modified to allow for the CNT incor- 
poration into lipid films before GUV formation. Briefly, the initial lipid film (total 
0.05 mg of the appropriate lipid mixture in chloroform/methanol (9:1 vol/vol) mix- 
ture) was formed in a round bottom flask using a rotary evaporator. 1 ml of CNT 
stock solution was added and mixed thoroughly to resuspend the lipid film. The 
mixture was dried in vacuum and then 50 ul of chloroform was added and rapidly 
dried in a rotary evaporator in order to form a homogeneous lipid-CNT film. 100 pl 
of distilled water was added for a 30 min pre-hydration of the film at 50 °C. The film 
was vortexed and the resulting mixture was added to solution containing 40 um 
plain silica beads (Corpuscular Inc., Cold Spring, New York, USA): a small (2.5 pl) 
drop of CNT/lipid mixture was added to a 0.5 pl drop of the bead solution (10% 
vol/vol) deposited on Teflon film. The drops were dried in a vacuum chamber drier 
(15 min, room temperature). CNT-GUVs formation was detected within 10 min 
after the addition of 300 mM sucrose buffer at room temperature. 

CNT-GUVs remained attached to the beads transferred to the observation cham- 
ber (with a glass-bottom observation window pretreated with bovine serum albu- 
min as described previously)” containing Buffer 1. The chamber was mounted in 
an inverted microscope equipped with an Andor iXon+ camera (Andor Technology, 
Belfast, UK) operating with the ImageJ-sManager open source software*’ and a 
LUCPFELF 40x/0.60NA objective. 550/590 nm excitation/emission wavelengths were 
used. Conductance measurements were performed 30 min after the GUV transfer 
to Buffer 1 to allow for the equilibration between the GUVs’ interior solution and 
the buffer. 

For patch-clamp experiments on planar bilayers and GUVs we used borosilicate 
glass pipettes (Science Products, GB150-10) filled with Buffer 1. The pipette resis- 
tance was 3-5 MQ and 0.15% to 0.30% of the CNT porin solution was added to the 
pipette solution. Experiments were conducted at room temperature. The pipette 
approach to the bilayer and the formation of a gigaseal was performed as described 
earlier’. The patch conductance was measured at +50 and —50mV holding 
potentials using an Axopatch 200B (Axon Instruments) amplifier and a PC-44 ac- 
quisition board (Signallogic). The sampling frequency f was 1 kHzand signals were 
passed through 8-pole Bessel filters (Frequency Devices) set at f/2 corner frequency. 
The data were processed using FITMASTER software (HEKA). 
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Extended Data Figure 1 | TEM images of isolated CNTs. a-c, Purified CNTs _ with lipid and lightly sonicated to aid nanotube dispersion. h-m, Isolated CNT 
before cutting (a and b, conventional TEM, c imaged on a cryo stage; all porins after the cutting procedure. 
air-dried samples). d, Magnified image of one CNT from c. e-g, CNTs mixed 
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Extended Data Figure 2 | Length distributions of CNT porins measured 
from TEM images. a, CNT porin sample after cutting and purification 
procedures (n = 65). b, Cut and purified sample after passing through a 100 nm 
pore filter membrane (n = 36). ¢, As b but after passing through a 50nm 
pore filter membrane (n = 156). 1 indicates the mean CNT porin length. 
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Extended Data Figure 3 | Characterization of CNT porins. a, Histogram 
of the diameters of short CNT fragments measured with AFM. Most CNT 
porins show diameters between 1 and 2 nm. Inset, high-magnification AFM 
image of a single CNT on a bare mica surface. b, Histogram of the diameters of 
CNT porins inserted into the lipid vesicles measured by cryo-TEM. c, Raman 
spectrum of the short-CNT/lipid complex after 16 h of sonication-assisted 
cutting. Inset, magnified view of the radial breathing mode region of the CNT 
spectrum (100-300 cm '). 
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Extended Data Figure 4 | Cryo-TEM images of CNT-liposome complexes. Original large field-of-view images (main panels); insets, magnified views of the 
selected structures, with false-colour lines added to highlight the lipid bilayer (red) and CNT porins (blue). 
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Extended Data Figure 5 | High magnification cryo-TEM imaged of CNT-liposome complexes. Panels (left to right) show unprocessed, image-processed, and 
colourized unprocessed examples. See Methods for details of image processing. 
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Extended Data Figure 6 | Single CNT porin conductance at different the data (dashed lines) correspond to 0.66 nS M_| at pH 7 and 0.36 nS M_| at 
electrolyte concentrations. Plots of the average single CNT porin pH 2. Error bars, s.d.; 1 values as follows (pH = 7: 0.15 M, n = 6; 0.5M, 


conductance at different values of background electrolyte (KCI) concentrations n= 161;1M,n=39;2M,n=8. pH =2:0.5M, n= 18; 1M,n=3;2M, 
at pH 7 (red circles) and pH 2 (blue squares). The slopes of the linear fitthrough —_ = 36). 
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Extended Data Figure 7 | Single CNT porin conductance at pH 2. 

a, Representative conductance traces showing individual CNT porin 
incorporation at pH 2 and 2 M KCI background electrolyte concentration. 
Traces i-iv represent incorporation from 4 independent experiments. 

b, Histogram of conductance values measured for 109 individual CNT channel 
incorporation events (n = 109). Solid blue line corresponds to the fit of the data 
to a distribution expected for multiple channel incorporation, where the 
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position and width of the second peak are determined from the position, 
(0.36 + 0.006 nS), and width, o (0.10 + 0.008 nS), of the first peak as 2 and 
V2c. c, Representative conductance traces showing individual CNT porin 
incorporation at pH 2 and 0.5 M KCI background electrolyte. Traces i-iv 
represent incorporation from 4 independent experiments. Control trace 
shows a conductance trace collected at the same conditions without adding 
CNT porins. 
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Extended Data Figure 8 | Representative traces and corresponding current 
level histograms of CNT porin incorporation, gating and DNA 
translocation in lipid bilayers at pH 7. a, Control conductance trace recorded 
in the absence of CNT porin. Although only the first 5 min are shown, the 
current level remained stable for up to 80 min (n = 3 X 10°). b, c, Conductance 
traces and histograms showing single CNT porin incorporation events and 
subsequent current fluctuations between gating sub-levels (n > 5 X 10°). 
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d, Trace and conductance value histogram showing CNT porin incorporation 
event and subsequent two DNA translocation events (indicated by black 
arrows). Here n = 3.6 X 10°. The peaks on the histogram correspond to 0.01 nS 
for pre-incorporation baseline, 0.22 nS for DNA blockade, and 0.67 nS for 
open pore conductance, which gives the DNA blockade of 0.45 nS and a 
blockade ratio of 67%. Dashed lines are added to all traces as a guide to the eye. 
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Extended Data Figure 9 | CNT porin conductance in cellular membranes. _porin incorporated into the plasma membrane of CHO cell, holding potential 


a, Schematic of the on-cell patch clamp measurement. b, c, On-cell —60 mV. f, Mean patch conductance (on cell configuration, 0.15% CNT 
conductance measurements demonstrate no channel activity (b) and stock solution in the patch pipette) detected in CHO and HEK293T cells at 
appearance of a small flickering channel (c) in control experiments with positive and negative holding potentials. Error bar, s.d.; 4 cells). In ~70% of 


HEK293T cells. Holding potentials were —60 mV (b) and +60 mV (c),thetime HEK293T cells and 95% of CHO cells we did not observe significant 
order is from left to right, top to bottom. d, In the presence of CNT porin (0.15% | dependence of the conductance on the holding potential; only those cells 
dilution of the stock), large channel activity is detected, holding potential were used for the conductance analysis. 

—60 mV. e, Metastable conductance sub-states of 50 pS detected with CNT 
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Extended Data Figure 10 | CNT porin conductance in patches isolated from and 92.5 + 0.6 pS (uncharged bilayer). b, Histogram of conductance values 
planar lipid bilayers and giant unilamellar vesicles. a, Histogram of measured for CNT porins reconstituted into giant unilamellar vesicles (GUVs) 
conductance values measured for CNT porin incorporation into a planar lipid mimicking the composition of charged plasma membrane (red bars, n = 40, 3 
bilayer mimicking the composition of a charged plasma membrane (red bars, | membranes), Solid lines correspond to the fit of the data to a distribution 


n = 681, 5 membranes, 3 controls without CNT porins showed no events) expected for multiple channel incorporation with the primary conductance 
and a similar lipid bilayer without charged species (black bars, n = 540, peak at 60 + 2 pS. c, Representative control experiment demonstrating lack 
5 membranes, 3 controls without CNT porins showed no events). The of spontaneous pore formation in the bilayer patch. d, e, Patch-clamp 
background electrolyte concentration was kept at 150 mM. Solid lines measurements of the CNT porin incorporation events in planar lipid 


correspond to the fit of the data to a distribution expected for multiple channel _ bilayer (d) and GUV membrane (e). 
incorporation. The primary peak positions are 65.1 + 0.4 pS (charged bilayer) 
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Centennial-scale changes in the global carbon cycle 
during the last deglaciation 


Shaun A. Marcott’?, Thomas K. Bauska!, Christo Buizert', Eric J. Steig?, Julia L. Rosen!, Kurt M. Cuffey’*, Ted. Fudge’®, 
Jeffery P. Severinghaus’, Jinho Ahn°, Michael L. Kalk', J oseph R. McConnell’, Todd Sowers®, Kendrick C. Taylor’, 


James W. C. White? & Edward J. Brook! 


Global climate and the concentration of atmospheric carbon diox- 
ide (CO,) are correlated over recent glacial cycles'”. The combination 
of processes responsible for a rise in atmospheric CO, at the last 
glacial termination’* (23,000 to 9,000 years ago), however, remains 
uncertain’ °. Establishing the timing and rate of CO, changes in the 
past provides critical insight into the mechanisms that influence the 
carbon cycle and helps put present and future anthropogenic emis- 
sions in context. Here we present CO, and methane (CH,) records of 
the last deglaciation from a new high-accumulation West Antarctic 
ice core with unprecedented temporal resolution and precise chro- 
nology. We show that although low-frequency CO, variations parallel 
changes in Antarctic temperature, abrupt CO2 changes occur that have 
aclear relationship with abrupt climate changes in the Northern Hemi- 
sphere. A significant proportion of the direct radiative forcing assoc- 
iated with the rise in atmospheric CO occurred in three sudden steps, 
each of 10 to 15 parts per million. Every step took place in less than 
two centuries and was followed by no notable change in atmospheric 
CO, for about 1,000 to 1,500 years. Slow, millennial-scale ventilation 
of Southern Ocean CO,-rich, deep-ocean water masses is thought to 
have been fundamental to the rise in atmospheric CO, associated 
with the glacial termination’, given the strong covariance of CO, levels 
and Antarctic temperatures’. Our data establish a contribution from 
an abrupt, centennial-scale mode of CO variability that is not directly 
related to Antarctic temperature. We suggest that processes operating 
on centennial timescales, probably involving the Atlantic meridional 
overturning circulation, seem to be influencing global carbon-cycle 
dynamics and are at present not widely considered in Earth system 
models. 

Ice cores from Greenland®” provide unique records of rapid climate 
events of the past 120 kyr. However, because of relatively high concen- 
trations of impurities*, Greenlandic ice cores do not provide reliable 
atmospheric CO, records. Antarctic ice cores, which contain an order- 
of-magnitude fewer impurities, provide reliable records®, but existing 
data from the deglacial period from Antarctica either provide only low 
temporal resolution’ or are of relatively low precision”. 

The West Antarctic Ice Sheet Divide ice core (WDC) (79.467° S, 
112.085° W, 1,766 m above sea level) was drilled to a depth of 3,405 m 
in 2011 and spans the past ~68 kyr. At present, the site has a mean annual 
snow accumulation of 22 cm ice equivalent per year and a surface tem- 
perature of —30 °C. Annual layer counting to 2,800 m depth (~30 kyr 
ago) provides a very accurate timescale for comparison with data from 
other archives’. The difference in age (Aage) between the ice and the 
gas trapped within it, which is critical for developing a gas-age chrono- 
logy, is 205 + 10 yr at present and was 525 + 100 yr at the last glacial 
maximum (LGM) (Extended Data Fig. 1). Given the high accumulation 
at the site, minimal smoothing due to gas transport and gradual occlu- 
sion, and precise chronological constraints, WDC is the best Antarctic 


analogue to central Greenlandic deep ice cores, with a substantially 
better-dated gas chronology during the glacial period, and is able to 
resolve atmospheric CO, at sub-centennial resolution. 

We exploit the unique aspects of WDC to reconstruct atmospheric 
CO, and CH, concentrations at high resolution over the last deglacia- 
tion. The atmospheric CH, record is important in this context because 
CHy varies in phase with rapid Greenlandic climate changes during the 
deglaciation and therefore provides a means of synchronizing our Ant- 
arctic record with Northern Hemisphere climate’*. Atmospheric CH, 
is primarily produced by land-based sources, principally tropical and 
boreal wetlands’* where emissions are driven by changes in temperature 
and hydrology, and CH, therefore serves as an indicator of processes in 
the terrestrial biosphere. 

The WDC deglacial CO, record is characterized by a long-term 
~80 p.p.m. increase, similar to other cores”"*, which begins at 18.1 kyr 
ago, occurs in several discrete steps throughout the deglacial transition, 
and ends 7 kyr later, during the early Holocene epoch. Given the precise 
WDC chronology, the phasing of CO, changes with respect to Antarctic 
and global temperature’ can be more precisely evaluated. The timing of 
the initial deglacial CO, rise leads the initial rise in global temperature” 
(17.2 kyr ago) by several centuries, and continues to lead across the entire 
termination (Extended Data Fig. 7). The timing of the initial CO, rise is 
consistent with the newest gas chronology of EPICA Dome C* (EDC), 
but more highly resolved and precise given the smaller gas-age uncer- 
tainty in WDC than in EDC (Extended Data 4). The WDC methane 
record shows in detail the abrupt changes at the onset of the Bolling- 
Allered and Younger Dryas stadials and the start of the Holocene. We 
observe a smaller, but prominent, methane excursion at 16.3 kyr ago 
not previously reported from other records (Fig. 1). Our record also 
resolves the beginning of the deglacial methane rise at 17.8 kyr ago 
(Extended Data Table 1). 

The WDC CO, record demonstrates that CO, varied in three dis- 
tinct modes during the deglaciation. The first mode is relatively grad- 
ual change (~10 p.p.m. kyr '): such changes in CO, began at 18.1 and 
13.0 kyr ago and were broadly coincident with a reduction in the strength 
of the Atlantic meridional overturning circulation (AMOC; Fig. 2i), a 
cold North Atlantic” and warming in the Southern Hemisphere’. The 
second mode is rapid increase: 10-15 p.p.m. increases in CO) occurred 
in three short (100-200 yr) intervals at 16.3, 14.8 and 11.7 kyr ago, the 
latter two at times of rapid resumption of the AMOC and warming in 
the Northern Hemisphere. The rapid changes at 14.8 and 11.7 kyr ago 
were first noted at EDC”, but the magnitude, duration and timing are 
now more fully resolved because of the unique site conditions at WDC. 
The third mode is no change in atmospheric CO}. These apparent pla- 
teaus in the WDC CO, record lasted for 1,000-1,500 yr and occurred 
directly after the rapid, century-scale increases. 
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Figure 1 | Greenhouse gas and stable water isotope measurements from 
Antarctica and Greenland. a, Oxygen isotopes from WDC" (grey; black line is 
11-point weighted average) and water-isotope-derived temperature composite 
from East Antarctica’ (green). 5480 = (80/ *°O) sampte/(O/ O)vsmow - 13 
VSMOW, Vienna Standard Mean Ocean Water. b, Atmospheric CO, 
concentrations (this study; black line is 5-point weighted average). c, Direct 
radiative forcing of CO2, CH, and N,O (ref. 31) using a simplified expression’”. 
d, Atmospheric CH, concentrations (this study and ref. 11). e, Oxygen isotope 
measurements from the North Greenland Ice Project® (NGRIP). Coloured 
bands at bottom indicate times when CO, is stable (blue), slowly increasing 
(pink) or rapidly increasing (red), as described in the text. LGM, Last Glacial 
Maximum; HS1, Heinrich stadial 1 (~18.0-14.6 kyr ago); B/A, Bolling- 
Allergd; YD, Younger Dryas; Hol, Holocene. 


The increase in CO, from glacial to interglacial states is typically 
explained by a combination of changes in deep-ocean ventilation, Ant- 
arctic sea-ice cover, ocean salinity and temperature, marine biological 
surface nutrient utilization, changes in ocean alkalinity, and carbonate 
compensation in the ocean’’. These processes are generally thought to 
take place at millennial or longer timescales. However, our new obser- 
vations from WDC demonstrate that in several instances the observed 
atmospheric CO, change was rapid, on centennial timescales; that the 
most rapid changes correlate in time with ice-rafted debris events in the 
North Atlantic or with abrupt changes in the AMOC (Fig. 2); and that 
the abrupt changes in CO) do not always have a centennial counterpart 
in the Antarctic 5'8O record (Fig. 1). 

Given these new observations, we suggest that the AMOC had a 
critical role in the glacial-interglacial rise of atmospheric CO, and may 
have acted as a modulator of both the timing and the magnitude of 
CO, and CH, change during the last termination, either directly, through 
changes in ocean circulation, or indirectly, through the redistribution 
of heat and the associated climate response. Elements of this hypothesis 
have been outlined previously: a reduction in the AMOC is thought to 
cause increased overturning in the Southern Ocean, which allows CO, 
in the deep ocean to enter the ocean mixed layer more rapidly, where it 
can exchange with the atmosphere'®. For methane, a cooling in the North- 
ern Hemisphere resulting from a reduction in the AMOC and a south- 
ward expansion of sea-ice extent may have displaced the intertropical 
convergence zone southwards” resulting in decreased production of 
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Figure 2 | WAIS Divide CO, and CH, data plotted against multiple 
environmental proxies. a, b, WAIS Divide CH, (a) and CO; (b). ¢, Iceberg 
discharge indices from the North Atlantic” (15-record composite; normalized 
units). IRD, ice-rafted debris. d, e, Biological flux proxies from the Southern 
Ocean"® (d; TN57-13PC and E27-23) and off New Zealand” (e; MD97-2120). 
f, g, Precipitation indices in speleothem records from Borneo” (f) and Hulu’ 
(g). h, Carbon isotopes of CO) (line with 1o uncertainty band) from a multi- 
core composite* placed on an updated chronology’. 88°C =(8C/ 2G) tne! 
(°c/?C)vppp - 1; VPDB, Vienna PeeDee Belemnite. i, An AMOC proxy’ 
(GGC5). All age models that use radiocarbon dates were recalibrated with 
IntCal13 and updated (Supplementary Data). Coloured bands at bottom of plot 
indicate times when CO), is stable (blue), slowly increasing (pink) or rapidly 
increasing (red), as described in the text. The grey and brown bands denote the 
timing of notable climatic transitions (for example LGM to HS1). 


methane from mid- and low-latitude sources, and vice versa when the 
Northern Hemisphere warmed. Several studies have proposed other 
complementary mechanisms (see, for example, ref. 18) for the CO, 
changes across the last termination; here we guide the discussion with a 
simple working hypothesis that the AMOC had a critical role, either 
directly or indirectly, in the rise of atmospheric CO. 

The gradual changes in CO) that begin at 18.1 and 13.0 kyr ago are 
accompanied by a reduction in the AMOC strength” and warming in 
Antarctica’! (Figs 1 and 2). Like CO», methane begins to increase grad- 
ually at ~ 18 kyr ago (50 p.p.b. kyr’ '), yet at 13.0 kyr ago it is decoupled 
from CO), and instead decreases by 200 p.p.b. in two centuries before 
stabilizing during the Younger Dryas (Fig. 1). The CO; rise and abrupt 
CH, drop at 13 kyr ago are synchronous within the resolution of the 
data (20-60 yr), and both rise almost synchronously at 18 kyr ago (Ex- 
tended Data Table 1), suggesting that a common forcing drives the 
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changes in both gases. Freshwater hosing experiments with a coupled 
atmosphere-ocean-biogeochemical model” show that atmospheric CO, 
begins to rise because of a decrease in the ocean carbon inventory due 
toa collapse in the AMOC, and continues rising until the AMOC resumes. 
The rate of CO; rise in the model”® (25 p.p.m. in 1,700 yr) is similar to 
rates observed from WDC, and the hypothesis that CO, fluctuations 
are controlled by ocean circulation change is consistent with the timing 
of the AMOC reduction based on ocean proxies (Fig. 2). Two significant 
depletion events in stable carbon isotope ratios of CO, from a com- 
posite of multiple ice cores*, by 0.3%o and 0.1%o, are consistent with the 
source of the CO, rises at 18.1 and 13.0 kyr ago being from upwelling of 
respired carbon from the deep ocean (Fig. 2h). 

The most abrupt changes in CO) during the last termination occur 
at 16.3, 14.8 and 11.7 kyr ago and are synchronous with abrupt increases 
in CH, within the 20-60 yr resolution of the data (Fig. 3). No relation- 
ship between the carbonate chemistry of the ice and CO, concentration 
is observed that would suggest in situ production of CO; as observed in 
Greenland (Extended Data Fig. 2), and the sharp increases are resolved 
by multiple data points, each multiply replicated. Moreover, the WDC 
CO, record is consistent with results from the EDC ice core**”* after 
accounting for the greater smoothing of the EDC gas record by the in- 
creased diffusion in the firn and the gradual bubble closure character- 
istic of low-accumulation sites (Extended Data Fig. 5b). For all three of 
the abrupt CO, increases, the concomitant CH, rise is even more sud- 
den, suggesting that the rate of the CO, increase in the WDC record is 
not limited by firn smoothing processes, but reflects the true atmo- 
spheric rate of change. 

For the abrupt increases at 14.8 and 11.7 kyr ago, both CO, and CH, 
increased by ~13 p.p.m. and ~200 p.p.b., respectively, in 100-200 yr, and 
then remained at the new levels. For these two events, the increases are 
coincident with proxy evidence for an abrupt recovery of the AMOC 
and a rapid warming of the North Hemisphere as recorded by Green- 
landic ice cores (Fig. 2). As previously noted, modelling suggests that 
slow increases in CO) can occur when the AMOC is reduced”. Conversely, 
similar modelling supports the hypothesis that a rapid resumption of 
the AMOC can result in an abrupt increase in atmospheric CO, and CH, 
through changes in carbon storage on land”! and increased precipitation 
in wetland regions. Other modelling studies” suggest that the CO, in- 
crease at the onset of the Bolling warming could have been much higher 


given the large smoothing at EDC. Such an overshoot in CO) is not 
recorded at the Bolling onset at WDC, but smaller overshoots at 11.7 
and 16.3 kyr ago do lend partial support to this mechanism. 

At 16.3 kyr ago, CO, and CH, also abruptly increased by 12 p.p.m. 
and 50 p.p.b. in 100 yr, respectively, although the elevated methane levels 
were short lived (Fig. 3). However, unlike the changes at 14.8 and 11.7 kyr 
ago, these increases show no apparent evidence for an abrupt AMOC 
resumption or warming in the Northern Hemisphere. We propose that 
the abrupt rise in CO, at 16.3 kyr ago is directly related to an iceberg dis- 
charge event recorded in several ocean cores from the North Atlantic”, 
which occurred while the AMOC was already reduced (Fig. 2c). One 
potential cause of the CO, increase at this time could be a southward 
shift in the position of the Southern Hemisphere westerlies™, leading to 
increased ocean upwelling and outgassing of respiration-derived CO, 
(ref. 18). Other archives preserve geochemical signals attributed to the 
iceberg discharge event at 16.3 kyr ago, for instance an increase in up- 
welling in the Southern Ocean"* (Fig. 2d) and off the coast of New Zea- 
land” (Fig. 2e), drying in northern Borneo” (Fig. 2f), and an inferred 
sharp decrease in monsoon intensity in central China” (Fig. 2g). Cooling 
and drying of large regions of the globe may also be a consequence of 
iceberg discharge and freshwater input into the North Atlantic’*, which 
could have initiated a rapid release of carbon from land stocks”. 

The small increase in methane at the time of the CO, rise at 16.3 kyr 
ago is somewhat puzzling because a southward shift in the intertropical 
convergence zone related to an iceberg discharge event would be expected 
to suppress northern tropical sources of CHy. However, observations 
from cave deposits suggest wetter conditions at southern equatorial sites 
during Heinrich stadials*®, which could compensate for reduced north- 
ern sources”’ and explain the small CH, increase. 

Finally, the apparent plateaux in CO, concentration that occur directly 
after each of the abrupt CO, increases at 16.3, 14.8 and 11.7 kyr ago 
(Fig. 1) deserve explanation. These plateaux last for 1,000-1,500 yr, and 
the plateaux following the abrupt changes at 14.8 and 11.7 kyr ago are 
contemporaneous with reinvigorated AMOC”, cooling in Antarctica® 
and a decreased north-south gradient in global temperature’. One pos- 
sible explanation is stratification of the Southern Ocean, perhaps in 
response to large freshwater discharges from the Antarctic ice sheet”®, 
which would lead to a reduction in outgassing of CO, to the atmosphere 
through decreased exchange of deep water with the surface ocean and 
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Figure 3 | Detailed view of greenhouse gas and stable isotope measurements 
from WDC. Oxygen isotope measurements from WDC" (black line is 

11-point weighted average), water-isotope-derived temperature composite from 
East Antarctica® (green), and atmospheric CH, (purple) and CO concentrations 
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(black line is 5-point weighted average) across the abrupt CO, transition at 
16.3 (a), 14.8 (b) and 11.7 kyr ago (c). Purple bands represent the durations of 
the abrupt CO, transition and orange lines show their magnitudes (durations 
and magnitudes also in parentheses; Extended Data Table 1). 
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atmosphere. Alternatively, the plateaux in CO, could be associated with 
the slow equilibration of atmospheric CO, with the ocean following the 
input of relatively large pulses of carbon during the CO, jumps (Ex- 
tended Data Fig. 5), or uptake of atmospheric CO; by the slow regrowth 
of the terrestrial biosphere. 

Our results provide evidence that the observed CO, rise during the 
last deglaciation was divided into two modes: a slow, millennial-scale 
mode that is closely linked to Antarctic temperature and associated with 
5'°CO, decreases (Fig. 2), suggestive of an oceanic origin; and a fast, 
centennial-scale mode that is closely linked to CH, and Northern Hemi- 
sphere climate, potentially controlled by AMOC strength and the assoc- 
iated climate teleconnections. Given that a large portion of the direct 
radiative CO, forcing during the glacial—interglacial rise occurred in 
approximately four centuries, future modelling and isotopic investi- 
gations should focus on explaining these abrupt CO) variations to fur- 
ther our understanding of the sensitivity of the carbon cycle under 
natural forcing. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Measurements of CO, were made at Oregon State University, and CH, measure- 
ments were made at Oregon State and Penn State University. CO, measurements 
were made with a mechanical crushing system using methods described by Ahn 
et al’. Approximately 1,030 measurements were made on ~320 separate depths 
spanning the time frame of 23,000-9,000 years BP with a median sampling reso- 
lution of 25 years. All samples were measured at least in duplicate and one to three 
times more for some depths. Nitrogen isotope data were used to correct the CO, 
data by 1.0-1.4 p.p.m. for gravitational fractionation. The mean standard error ofall 
measurements was 1.0 p.p.m. A consistent 4 p.p.m. offset between WDC and EPICA 
Dome C CO, concentrations is observed. A similar offset is observed between W AIS 
Divide and other CO, reconstructions**. Methane concentrations were measured 
for approximately 1,100 ice samples from ~580 depths covering 23,000-11,500 
years BP (20-year median resolution) using methods described by Mitchell et al.** at 
Oregon State University and combined with data measured at Penn State University". 
Duplicate methane measurements were made on all but fifty samples, and the 
mean standard error of all measurements made at Oregon State University was 
0.8 p.p.b. CO, and CH, measurements were calibrated using dry standard air with 
known mole fractions calibrated at the NOAA Earth System Research Laboratory 
(WMOX2007 scale for CO2; NOAA04 scale for CH,). 

WAIS Divide timescale. The gas chronology used in this study (WDC06A-7) is 
based on the layer-counted WDC06A-7 chronology’. Gas ages were obtained by 
subtracting a modelled ice-age/gas-age difference (Aage) from the ice-age time- 
scale**. Aage was calculated using a dynamical firn-densification model with heat 
diffusion. The modelling is constrained by measurements of 5'°N of Np, a proxy 
for past firn column thickness*”. 5'°N was measured on the WAIS Divide ice core 
(WDC) at the Scripps Institution of Oceanography with a 300-year average reso- 
lution (Extended Data Fig. 1b) following procedures outlined elsewhere**’. The 
air was extracted from ~13 g ice samples using a melt-refreeze technique, and 
collected ona cold finger at liquid-He temperatures. The samples were analysed on 
a Thermo Finnigan Delta V dual-inlet mass spectrometer to determine '°N/"4N of 
Nz (8'°N), '80/1°O of O, (5'80), 370,/78N> (802/N>) and *°Ar/*8N, (SAr/N)). 
Routine analytical corrections were made to the data for pressure imbalance and 
chemical slope, and results were normalized to the La Jolla present-day atmosphere. 

We use a dynamical adaptation of the Herron and Langway firn-densification 
model” coupled to a heat diffusion—advection model". We use the present-day con- 
vective zone thickness of 3.5 m and a parameterization for surface snow density”. 
The forward model requires past temperatures and accumulation rates as inputs, 
and generates Aage and 5'°N as outputs. Surface temperatures (Extended Data 
Fig. 1a) are obtained from the 5'°O isotopic composition of the ice", with the 5'°O- 
temperature relationship calibrated'*“** to fit observed borehole temperatures. 
Accumulation rates were obtained in two ways. First, accumulation can be recon- 
structed from the observed annual layer thickness, corrected for thinning due to ice 
flow (Extended Data Fig. 1c, black curve). However, the thinning function becomes 
increasingly uncertain with depth (the CO, data presented here were measured 
over the depth range of 1,700-2,600 m within the 3,405 m-long core). Second, accu- 
mulation can be reconstructed from the 5!°N data using a firn-densification inverse 
model*’. Using the temperature history, the model determines the accumulation 
history that optimizes the fit to the 8'°N data by making small adjustments to the 
thinning function (Extended Data Fig. Ic, red curve). The model fit to the 5'°N 
data is shown in Extended Data Fig. 1b. We find a good agreement between both 
accumulation rate reconstructions. 

The modelled Aage is shown in Extended Data Fig. 1d (orange curve), where we 
have used the accumulation rates from the inverse 5'°N method. A sensitivity 
study in which we varied past convective zone thicknesses, temperature history, 
dust sensitivity** and firn-densification physics”” shows a lo uncertainty in Aage 
of ~15%. We also calculated Aage using the depth-difference method developed 
by Parrenin et al.° (Extended Data Fig. 1d, black curve). This method also suffers 
from uncertainty in the layer thinning function. Both methods agree within the lo 
uncertainty band. We note that at around 12 kyr Bp the WDC site experienced a 
very sudden and short-lived increase in accumulation rates as inferred from the 
annual layer thicknesses (Extended Data Fig. 1c). This event is also recorded in the 
5'°N as a thickening of the firn column (Extended Data Fig. 1b). 

Potential for in situ production of CO. Prior work in Greenland® has demon- 
strated that CO, records can be compromised through in situ production when 
abundant concentrations of calcium carbonate are present. Calcium is delivered to 
the ice in the form of carbonate dust that blows onto the ice-sheet surface and is 
buried by subsequent accumulation. Although problems with in situ production 
have never been detected in CO, records from Antarctic ice cores, probably because 
Antarctic sites experience an order of magnitude less calcium carbonate deposition 
than Greenland“, each ice core from Antarctica is both geographically unique and 
affected by distinct atmospheric processes that can deliver various chemical species 


to the sites. To examine whether abrupt changes in CO2 in WDC are related to in situ 
production, we compared the non-seasalt component of calcium, which we take 
as a proxy for terrestrial dust, with the carbon dioxide concentration at times of 
abrupt change in CO, (Extended Data Fig. 2). We find no relationships and note 
that the non-seasalt Ca also is an order of magnitude lower at WDC than in Greenland 
ice cores, similar to other ice cores from Antarctica®“*. In addition to calcium car- 
bonate, the oxidation of organic compounds can also affect the CO2 concentration 
from ice cores’. At WDC the concentration of the oxidant HO, in the upper 
section of the core (WDC-05A) averaged less than 30 p.p.b.°°”', which is similar to 
other values from Antarctic ice cores”, and does not covary with CO, concentra- 
tions over the interval from 1,940 to 2,000 m (average 20 p.p.b.) in the deeper section 
of WDC from where we have data. 

Smoothing of the CO, record by firn processes. Although it may appear that the 
WDC and EDC CO; records exhibit fundamentally different patterns over short 
timescales (Extended Data Fig. 3), these differences are expected as a result of the 
dramatically different conditions at the ice-core sites, which dictate how much 
smoothing atmospheric gas records experience before being locked into the ice- 
core record. Temperature and accumulation rates at WDC vary between 233 and 
243 K and, respectively, between 0.10 and 0.30 m ice equivalent per year over the 
deglacial transition. At EDC, in contrast, conditions are colder and dryer, aver- 
aging 209 K and 0.015 myr’_'. Because of this, firn models predict that Aage and 
the age distribution (a measure of smoothing) should be an order of magnitude 
smaller at WDC than at EDC (Extended Data Fig. 4). 

To demonstrate the consistency of the two CO) records, we generate a smooth- 
ing function for EDC using the OSU firn air model® (Extended Data Fig. 5a). The 
function describes the age distribution of the air in the closed porosity of the ice 
core, and accounts for gradual bubble closure as ice is advected through the lock-in 
zone. We then apply this filter to the WDC gas record, using it as a proxy for the 
true atmospheric history. Because WDC gases experience very little smoothing 
during bubble trapping, we argue that this is a reasonable assumption (smoothing 
WDC with a WDC filter does not change the record). The resulting curve agrees 
well with the EDC CO, record”’*** (Extended Data Fig. 5b), supporting the idea 
that differences between the two records arise because of differences in site con- 
ditions and bubble-trapping processes, and reflect no disagreement about the 
nature of the deglacial rise in atmospheric CO). 

A consistent 4 p.p.m. offset between WDC and EPICA Dome C CO, concen- 

trations is observed in our new deglacial reconstruction after consideration for firn 
smoothing processes (Extended Data Figs 3 and 5). This offset represents 1-2% of 
the total concentration measurements and is outside the stated uncertainties for 
both records. A similar offset of 2-4 p.p.m. has also been observed between WDC 
and CO), reconstructions from Law Dome and EPICA Dronning Maud Land*4, 
along with a 2 p.p.m. offset between Dronning Maud Land and the South Pole”. 
On the basis of interlaboratory comparisons between Law Dome and WDC™, the 
offset between the ice cores is unlikely to result from laboratory methods and rep- 
resents a real difference between the two ice cores, though the exact nature of the 
offset still remains to be determined. 
Modelling CO, pulses and plateaux. Using the WDC CO, record as a proxy for 
the true atmospheric history, we employ a box model of the ocean carbon cycle” to 
explore how a pulse of CO, emissions is attenuated and potentially creates a plateau 
in the concentration record. The model is divided into three boxes: a well-mixed 
atmosphere, a well-mixed surface ocean mixed layer and a diffusive thermocline/ 
deep ocean. The model does not account for sedimentation of CaCO; in the ocean. 
To simulate an abrupt increase in CO,, we apply a pulse input of 0.5 Pg Cyr! that 
lasts for 100 years. Following the 100-year pulse, the model simulates a slow ocean 
uptake over the next 1,500 years (Extended Data Fig. 6). Although this model is not 
designed to simulate the entire CO, variability in the WDC record, this simple 
simulation suggests (but does not require) that the three abrupt CO; rises at 16.3, 
14.8 and 11.7 kyr ago could represent brief injections of CO) into the atmosphere 
that are superimposed ona longer-term increase, with the plateau that follows each 
of the abrupt rises explained by the drawdown of the additional carbon in the 
atmosphere into the ocean. 

Kohler et al." performed a similar experiment using a more comprehensive carbon 
cycle box model to describe an abrupt CO, change at the onset of the Bolling. Using 
similar modelled pulses of CO, they also reproduced the abrupt CO; features. They 
concluded that the magnitude of the CO, change could have been as high as 30 p.p.m. 
fora 2.5 Pgyr ‘injection of carbon into the atmosphere after considering smooth- 
ing of the atmospheric signal in the firn at EDC. Unlike the low-accumulation site 
at EDC, however, the high-accumulation WDC site provides a CO record that has 
experienced very little firn smoothing (see Methods section on the WAIS Divide 
timescale), and provides a much closer approximation of the true atmospheric his- 
tory. Therefore, we conclude that if there were a pulse input of CO) its magnitude 
and duration would be of the order of 0.5 Pg Cyr‘ and would last for 100 years. 
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Breakpoint determination. To provide an objective means for determining the 
timing and duration of the CO, and CH, transitions across the last deglaciation, 
we employed the functions Breakfit*° and Rampfit*’. For the three abrupt jumps in 
CO), both the timing and duration of the changes were quantified (Extended Data 
Table 1) using the Rampfit”’ function with a 500-year search window. For each, the 
same seed generator number (200) was used, and 200 bootstrap (wild) iterations 
were performed to determine the uncertainties. For the CO2 and CH, transitions 
at the start of the deglaciation and beginning of the Younger Dryas, the Breakfit”® 
function was used to determine the timing of the transitions. The window length 
for each was 1,000 years and the same seed number (200) and bootstrap number 
(200) were used to quantify the timing uncertainty. 
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Extended Data Figure 1 | 5'°N and the ice-age/gas-age difference for the observations (black curve). c, 5'°N-N, data for the upper 2,800 m (black dots) 
WDC. a, Borehole calibrated surface temperature reconstruction derived from — with model fit (green curve). d, Modelled age using firn-densification model 
5'8O measurements from the ice’. b, Accumulation rates reconstructed with (orange curve) and Aage estimate using the depth-difference technique from 
the firn-densification inverse model (red curve) and from layer thickness Parrenin et al.’ (black curve). 
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Extended Data Figure 2 | CO, concentrations and elemental data for WDC. _in the core where we observe abrupt changes in carbon dioxide. Hydrogen 
WDC CO, concentrations (blue) plotted against non-seasalt calcium (nssCa) _ peroxide concentrations have been smoothed (2 m centred average) from 
concentrations (black) and hydrogen peroxide (H,O, red) at multiple depths _ original data to improve clarity. 
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Extended Data Figure 3 | CO, concentrations for WDC and EDC. WDC Lemieux-Dudon et al.?'*** (brown), Parrenin et al.° (red) and Antarctic 
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Extended Data Figure 4 | Calculated Aage offsets across the last deglacial 
termination for five ice cores from Antarctica and Greenland, compared 
with WDC. EDML, EPICA Dronning Maud Land; TALDICE, Talos Dome 
Ice; NGRIP, North Greenland Ice Project. Ice-core data from refs 58, 59. 
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Extended Data Figure 5 | Firn smoothing functions applied to CO data EDC (dots) plotted against artificially smoothed CO; data from WDC using the 
from WDC and EDC. a, The red line is the Green’s function (smoothing EDC firn smoothing function (red line in both plots). WDC data have been 
function) produced by a firn model using an assumed EDC accumulation rate —_ systematically lowered by 4 p.p.m. for direct comparison with EDC. 
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Extended Data Figure 6 | Simple box model source history and atmospheric | WDC (5-point weighted average; blue) and the model derived atmospheric 
CO, response compared to measured data from WDC. a, Applied source history (black). Box model from ref. 55. 
history used in the modelling experiment. b, Atmospheric CO, record from 
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Extended Data Figure 7 | CO, concentrations and temperature band is 1o uncertainty envelope), and an Antarctic temperature stack based on 
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Extended Data Table 1 | Timing of five abrupt transitions in CO2 and CH, during the last termination 
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Calcisponges have a ParaHox gene and dynamic 
expression of dispersed NK homeobox genes 


Sofia A. V. Fortunato’, Marcin Adamski!, Olivia Mendivil Ramos*+, Sven Leininger't, Jing Liu', David E. K. Ferrier? 


& Maja Adamska! 


Sponges are simple animals with few cell types, but their genomes 
paradoxically contain a wide variety of developmental transcription 
factors’ *, including homeobox genes belonging to the Antennapedia 
(ANTP) class**, which in bilaterians encompass Hox, ParaHox and 
NK genes. In the genome of the demosponge Amphimedon queens- 
lJandica, no Hox or ParaHox genes are present, but NK genes are 
linked in a tight cluster similar to the NK clusters of bilaterians’. It 
has been proposed that Hox and ParaHox genes originated from NK 
cluster genes after divergence of sponges from the lineage leading to 
cnidarians and bilaterians”’. On the other hand, synteny analysis lends 
support to the notion that the absence of Hox and ParaHox genes in 
Amphimedonisa result of secondary loss (the ghost locus hypothesis)*. 
Here we analysed complete suites of ANTP-class homeoboxes in two 
calcareous sponges, Sycon ciliatum and Leucosolenia complicata. Our 
phylogenetic analyses demonstrate that these calcisponges possess 
orthologues of bilaterian NK genes (Hex, Hmxand Msx), a varying 
number of additional NK genes and one ParaHox gene, Cdx. Despite 
the generation of scaffolds spanning multiple genes, we find no evi- 
dence of clustering of Sycon NK genes. All Sycon ANTP-class genes 
are developmentally expressed, with patterns suggesting their involve- 
ment in cell type specification in embryos and adults, metamorphosis 
and body plan patterning. These results demonstrate that ParaHox 
genes predate the origin of sponges, thus confirming the ghost locus 
hypothesis*, and highlight the need to analyse the genomes of mul- 
tiple sponge lineages to obtain a complete picture of the ancestral 
composition of the first animal genome. 

Sponges (Porifera) are strong candidates for being the earliest extant 
lineage(s) of animals’. The genome sequence of the demosponge A. 
queenslandica has provided rich material for comparative studies look- 
ing at the origins of metazoan developmental genes, cell types and body 
plans’. Among others, it has fuelled hypotheses about the origin of one of 
the most widely studied groups of developmental genes: the ANTP-class 
homeoboxes, including Hox, ParaHox and NK genes*”*?*"!,. ANTP-class 
genes have been found in all animals and are involved in multiple devel- 
opmental processes, including body plan patterning and neurogenesis”. 
Hox, ParaHox and NK genes are often found in clusters®”’, and in some 
animals their expression is temporally or spatially correlated to their 
position within the cluster (this being known as colinearity)’*. The Amphi- 
medon genome contains eight NK genes, but neither Hox nor ParaHox 
genes are present”. Six NK genes are linked in a tight cluster, and their 
simple embryonic and larval expression patterns are not consistent with 
colinearity”*. The lack of Hox and ParaHox genes in Amphimedon, and 
also in the ctenophore Mnemiopsis leidyi’, has previously been inter- 
preted as reflecting the ancestral condition, and gave rise to the Para- 
Hoxozoa hypothesis, in which all animal lineages apart from poriferans 
and ctenophores are collectively known as the ParaHoxozoa. Others'® 
have interpreted the phylogenetic evidence differently, suggesting that 
both Hox and ParaHox genes were originally present in sponges, but 
have subsequently been lost. This view has been recently revived by the 


identification of Hox and ParaHox ‘ghost loci’ (regions that display con- 
served synteny with bilaterian Hox and ParaHox loci, but lack the Hox/ 
ParaHox genes themselves) in the genome of Amphimedon’*. 

Weexpected that expanding the range of sequenced sponge genomes 
would provide new information about the evolutionary history of genes 
important for the origin and evolution of the animal kingdom. Calci- 
sponges form a poriferan lineage that has been separated from demo- 
sponges for at least 600 million years’. We recently started analysis 
of the developmental toolkits of two calcisponges, S. ciliatum and L. 
complicata**"*. Here, we searched for ANTP-class homeobox genes in 
the transcriptomic and genomic assemblies of these species. 

We retrieved ten ANTP-class homeodomains in Sycon, constituting 
nine transcripts (one with two homeoboxes), and twelve ANTP-class 
homeodomains in Leucosolenia, constituting nine transcripts (one with 
four homeoboxes) (Supplementary Data 1). Our phylogenetic analyses 
demonstrate that the repertoire of ANTP-class genes is similar between 
the two calcisponges, but strikingly different than in the demosponge 
Amphimedon. Calcisponges and demosponges have clear orthologues 
of the bilaterian genes Hex and Msx; calcisponges also have a clear NK5 
(also known as Hmx) orthologue, which seems to be lacking in Amphi- 
medon. By contrast, this demosponge has possible Bsh, BarH and Tlx 
genes, which are not recognizable in calcisponges. While in Amphimedon 
there is a single gene associated with the bilaterian NK2/3/4 clade”, 
several paralogues are present in the two calcisponges. They contain 
multi-homeobox genes with non-orthologous relationships between 
Sycon and Leucosolenia, and other genes containing single homeoboxes 
in the NK2/3/4 clade. Affiliation of Sycon and Leucosolenia NKB and 
NKG and the Leucosolenia (Lco)NKF genes with a particular bilaterian 
NK family is not clear. No Hox genes were found; however, a pair of the 
calcisponge genes showed affinities with the ParaHox Cdx family, given 
the concordance of the results of neighbour-joining and maximum- 
likelihood analyses (Fig. 1 and Extended Data Fig. 1). 

Given the importance of this potential assignment, we performed 
further phylogenetic analyses of these putative Cdx orthologues. In addi- 
tion to the ELEKEF motif, which is shared by many Hox and ParaHox, 
but not NK-type homeodomains, the Cdx family has some distinctive 
residues in its homeodomain, most notably the YIT motif, which is 
present only in a small number of other ANTP-class homeodomains 
(Supplementary Note 1 and Extended Data Fig. 2). Phylogenetic ana- 
lyses focused on these few families, in addition to families represented 
in sponges—on the basis of greater taxon sampling than in the overall 
classification—produced a significantly supported clustering of Sycon 
(Sci)Cdx and LcoCdx with Cdx genes from other species in neighbour- 
joining, maximum-likelihood and Bayesian analyses (Extended Data 
Figs 3-5). 

We also investigated the genomic neighbourhood of SciCdx to help 
resolve the identity of this homeobox gene (Fig. 2, Supplementary Note 2 
and Supplementary Table 1). From the 14 genes on the SciCdx scaffold 
that have clear human orthologues (Supplementary Table 1), four are 
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Figure 1 | Phylogenetic tree of the ANTP-class homeodomains. A 
neighbour-joining tree is displayed. Three support values are shown: the black 
value is the neighbour-joining bootstrap support, the blue value is the 
maximum-likelihood bootstrap support and the red value is the posterior 
probability from Bayesian analysis. Bootstrap values below 10% and posterior 
probability values below 0.5 are not shown except for associations of 
calcisponge sequences. The root was determined by using selected PRD-class 
homeodomains as an outgroup. Aqu, A. queenslandica; Bfl, Branchiostoma 
floridae; Tca, Tribolium castaneum. Scale bar indicates the number of amino 
acid substitutions per site. 


orthologues of genes linked to ParaHox loci in humans. One of these, 
SciSar1A/B, also has a conserved neighbouring relationship with the 
ParaHox cluster in the cnidarian Nematostella vectensis (Fig. 2a, b). 
Although these gene numbers are insufficient to reach statistical signi- 
ficance, the neighbour relationships are consistent with the identifica- 
tion of SciCdx as a ParaHox gene. Furthermore, as one would expect 
from the ghost locus hypothesis and the identification of SciCdx as a 
bona fide ParaHox gene, we also find clustering of Sycon orthologues 
of ParaHox and Hox neighbour genes into two distinct groups in the 
Sycon genome to statistically significant levels (Fig. 2c—e). Altogether, 
the evidence is consistent with the identification of SciCdx and LcoCdx 
as the first examples of sponge ParaHox genes. 

All of the Sycon NK genes are found on separate scaffolds (Extended 
Data Fig. 6), with multiple additional genes surrounding the homeobox 
genes. We interpret this as the ancient NK cluster having been broken 
apart in the Sycon genome. Alternatively, our current assembly is not 
sufficient to provide evidence of a cluster with multiple genes inserted 
between the NK genes. It has been previously shown that arrangements 
of NK genes are variable between different species, ranging from intact 
and conserved NK clusters*'*”” to clusters that are partially broken’*’. 

Westudied the expression of ANTP-class genes in Sycon using a com- 
bination of in situ hybridization with quantitative transcriptome ana- 
lysis (Fig. 3, Supplementary Note 3 and Extended Data Figs 7-9). For 
all ANTP-class genes, except SciHex, expression can be detected in oocytes 
and during cleavage (Fig. 3a and Extended Data Fig. 7a-g). During 
embryogenesis, the most striking expression domain of the majority of 
the identified genes is in the cruciform cells, which are putative larval 
sensory cells*'*. Beginning at the four-cell stage, stronger expression of 
SciNKA marks the cytoplasm destined to become partitioned into the 
cruciform cells (Fig. 3d and Extended Data Fig. 7h-q), and expression 
of SciHmx is also markedly elevated in these cells (Fig. 3e). SciNKC and 
SciNKD are uniquely and strongly expressed in the cruciform cells of 
more advanced (pre-inversion stage) embryos (Fig. 3f, g). SciNKA is 


Figure 2 | SciCdx synteny and 
ghost loci simulations. 
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Figure 3 | Expression of S. ciliatum ANTP-class genes. a, Adult S. ciliatum. 
b, Expression heat map. Asterisks indicate statistically higher*””’ expression in 
the apical/top region in comparison to middle (M) or basal (B) parts. 
Drawings reproduced from ref. 14. INF, positive infinity; repr., reproductive. 
c-k, S. ciliatum oocytes (c), cleavage (d, e) and pre-inversion embryos (f-k). 
The formation of cross cells (cc) expressing multiple ANTP-class genes is 
indicated by arrows (d-h) or rainbow colouring (k). Macromeres (mac) 


additionally detected in macromeres of embryos and larvae, and along 
with SciNKG and SciNKB domains, forms a set of adjacent stripes along 
the larval anterior—posterior axis (Fig. 3h—-p). This pattern is reminis- 
cent of ‘striped’ patterns reported for NK genes in bilaterians’”, and might 
be indicative of roles for the calcisponge NK genes in axial patterning 
of the larval body plan or in cell type determination, with cells destined 
for specific fates distributed along the larval axis. For example, the mac- 
romeres give rise to the pinacocytes of the outer cell layer”, and the 
SciNKG-positive micromeres are good candidates for future sclero- 
cytes (spicule-producing cells), given the co-expression of SciNKG and 
sclerocyte-specific carbonic anhydrases”’. All of the ANTP-class genes 
except SciNKC and SciNKD are expressed during metamorphosis (Fig. 3b 
and Extended Data Fig. 9) in subpopulations of cells in all three cell layers 
(Extended Data Fig. 7). The clear expression of SciCdx in the inner cell 
mass during formation of the choanocyte chamber (Fig. 3q) is particu- 
larly striking in light of the recently revived notion of homology of the 
sponge choanoderm with bilaterian endoderm™, as ParaHox expres- 
sion in bilaterians is often associated with the developing gut. In adults, 
most of the ANTP-class genes display differential expression along the 
body axis (Fig. 3b and Extended Data Table 1). SciNKG and SciNKA are 
strongly expressed in sclerocytes, while SciMsx and SciHm<x transcripts 
are predominantly detected within and around the oscular sphincter 
(Fig. 3r-w). 

In summary, analysis of ANTP-class genes in a previously understudied 
lineage of sponges allowed us to demonstrate pre-poriferan ancestry of 
ParaHox genes, thus confirming the ghost locus hypothesis and reject- 
ing the ParaHoxozoa hypothesis of Hox/ParaHox gene origins. Expres- 
sion patterns of the identified genes indicate that developmental functions 
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expressing SciNKA are indicated by asterisks (f-o) or blue colouring (k, p). 
Equatorial micromeres (mic) expressing SciNKG and SciNKB are indicated in 
red (k). I-w, Post-inversion embryos and larvae (I-p), post-larva (q), and top 
parts of sponges (r-w) are shown. SciNKG and SciNKA in sclerocytes, and 
SciMsx and SciHmx in cells of the oscular sphincter are indicated with arrows. 
Scale bars: 10 um (c-h, s, u); 25 jum (1-o); 50 um (q, r, t, v, w). 


of ANTP-class genes also predate poriferans, with probable involvement 
in the specification of potentially homologous structures (choanoderm/ 
endoderm and cross cells/sensory cells), as well as morphological nov- 
elties (calcareous spicules). Differences in ANTP-class gene repertoires 
between the demosponge Amphimedon and the two calcisponges Sycon 
and Leucosolenia are striking, and the fact that both classes of sponges 
share a subset of genes with bilaterians indicates independent gene loss 
events in the two poriferan lineages. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Genome and transcriptome assemblies will be described in detail elsewhere (M. 
Adamski, S.L. and M. Adamska, unpublished observations). Briefly, the high-quality 
draft genome assembly of S. ciliatum was generated using two (360 bp and 530 bp) 
paired-end libraries and several mate-pair libraries ranging from 2.0 to 9.0 kb, and 
the preliminary draft assembly of L. complicata was generated from a single 295 bp 
paired-end library, all prepared and sequenced by Illumina technology. Assembly 
was performed using SOAPdenovo2? (ref. 24) and scaffolding was performed using 
SSPACE v.2.2 (ref. 25), and resulted in the length of the shortest scaffold for which 
scaffolds of equal or greater length contain at least half of the nucleotides in the 
genome (N50) = 150 kb and 450 bp for S. ciliatum and L. complicata, respectively. 
Transcriptomes were assembled using Trinity”®. For S. ciliatum, genomic scaffolds 
and transcripts of sponge origin (as opposed to those derived from associated organ- 
isms) were identified by aligning the resulting assembly to reads from an Illumina- 
sequenced library obtained from laboratory-grown, eukaryotic-contamination-free 
juveniles. The calcisponge ANTP-class sequences were retrieved from these assem- 
blies using TBLASTN with representative query homeodomain sequences from A. 
queenslandica, Mus musculus, T. castaneum and B. floridae. For phylogenetic anal- 
ysis, we selected B. floridae and T. castaneum to provide a framework for the classi- 
fication of the sponge sequences, as these species have been shown to collectively 
contain homologues of all major bilaterian ANTP-class genes”. Their homeodomain 
sequences were extracted from HomeoDB™. Prottest3.0 (ref. 29) and Modellgenerator 
v.0.85 (ref. 30) were used to determine the best suitable model of sequence evolution 
(LG+G). Phylogenetic analyses were based on neighbour-joining (Phylip v.3.69), 
maximum-likelihood (PhyML v.3.0) and Bayesian inference (MrBayes v.3.1.2) meth- 
ods. Gene expression was studied using available packages”. S. ciliatum ANTP- 
class gene amplification, cloning, sequencing, probe production and single in situ 
hybridization were performed as described previously”. In the double in situ exper- 
iment, samples were hybridized simultaneously with digoxigenin-labelled SciNKB 


probe and fluorescein-labelled SciNKG probe. After detection of the digoxigenin- 
labelled probe with NBT/BCIP substrate, the anti-digoxigenin antibody was removed 
by two 5 min washes in 0.1 M glycine/HCl, pH 2.2, 0.1% Tween 20 followed by three 
additional maleic acid buffer washes. A second round of pre-blocking, antibody 
incubation and post-antibody washes were as in the single probe protocol with the 
exception that anti-Fluo-AP antibody was used and the colour was developed using 
Fast Red tablets (Roche) according to the manufacturer’s instructions. Photographs 
demonstrating gene expression are representative of multiple individual specimens, 
with the following replicates. Oocytes and embryos: 3-4 small pieces of adult sponge, 
each containing tens to hundreds of oocytes or embryos of a given developmental 
stage; young syconoid sponges: at least 5 individual specimens; juveniles: small 
Petri dishes or wells of multi-well plates containing at least 10 juveniles. At least 
two independent experiments were carried out for each probe. 
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read de novo assembler. GigaScience 1, 18 (2012). 
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Extended Data Figure 1 | Phylogenetic tree of the ANTP-class 
homeodomains including representative bilaterian and non-bilaterian 
sequences. A neighbour-joining tree using the JTT +G (0.5) (1,000 bootstraps) 
model of protein evolution is displayed. A combination of three support 
values obtained for three phylogenetic methods is shown: black value is the 
bootstrap support from the neighbour-joining method, blue value is the 
bootstrap support from the maximum-likelihood method (LG+G 0.5), and red 
value is posterior probability from Bayesian analysis (LG+G 0.5). Bootstrap 


537 0.93 
{— MleANTPO3d 


support values below 10% and posterior probability values below 0.5 are not 
shown except for associations of calcisponge sequences. The root was 
determined by using selected PRD-class homeodomains as an outgroup. Aqu, 
A. queenslandica (Porifera/demosponges); Bfl, B. floridae (Chordata); Lco, 

L. complicata; Mle, M. leidyi (Ctenophora); Nve, N. vectensis (Cnidaria); Sci, 
S. ciliatum (Porifera/calcisponges); Tad, Trichoplax adhaerens (Placozoa); 
Tca, T. castaneum (Arthropoda). Scale bar indicates the number of amino acid 
substitutions per site. 
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Extended Data Figure 2 | Variability of the YIT/YIS homeodomain motif _rerio; Edi, Eleutheria dichotoma; Gga, Gallus gallus; Hsa, Homo sapiens; Lgi, 


within the Cdx/Cad, En and Dbx families in bilaterians, cnidarians, a Lottia gigantea; Mmu, Mus musculus; Nv, Nereis virens; Nve, N. vectensis; Pdu, 
placozoan and sponges. Ame, Apis mellifera; Aqu, A. queenslandica; Bfl, Platynereis dumerilii; Pfl, Ptychodera flava; Tad, T. adhaerens; Tca, 
B. floridae; Cte, Capitella teleta; Dme, Drosophila melanogaster; Dre, Danio T. castaneum; Xla, Xenopus laevis. 
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Extended Data Figure 3 | Phylogenetic tree including ANTP-class 
homeodomain families represented in sponges and two additional families 
characterized by the presence of the YIT motif but excluding divergent 

A. queenslandica sequences. Neighbour-joining (JTT, 1,000) bootstrap 
support values are in black, maximum-likelihood (LG+G 0.4, 1,000 replicates) 
bootstrap support values are in blue and Bayesian (LG+G 0.4) posterior 
probability values are in red. Only bootstrap support values equal to or above 
500 are shown. All families except Cdx are collapsed for clarity. Ame, 

A. mellifera; Aqu, A. queenslandica; Bfl, B. floridae; Cte, C. teleta; Dme, 

D. melanogaster; Dre, D. rerio; Edi, E. dichotoma; Gga, G. gallus; Hsa, H. sapiens; 
Lgi, L. gigantea; Mmu, M. musculus; Nv, N. virens; Nve, N. vectensis; Pdu, 

P. dumerilii; Pfl, P. flava; Tad, T. adhaerens; Tca, T. castaneum; Xla, X. laevis. 
Scale bar indicates the number of amino acid substitutions per site. 
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Extended Data Figure 4 | Phylogenetic tree induding ANTP-class 
homeodomain families represented in sponges and three additional families 
characterized by the presence of YIT/YIS motifs but excluding some 
divergent A. queenslandica sequences. Neighbour-joining (JTT, 1,000 
replicates) bootstrap support values are in black, maximum-likelihood (LG+G 
0.4, 1,000 replicates) bootstrap support values are in blue and Bayesian (LG+G 
0.4) posterior probability values are in red. Only bootstrap support values 
equal to or above 500 are shown. All subfamilies except Cdx are collapsed 
for clarity. Ame, A. mellifera; Aqu, A. queenslandica; Bil, B. floridae; Cte, 

C. teleta; Dme, D. melanogaster; Dre, D. rerio; Edi, E. dichotoma; Gga, G. gallus; 
Hsa, H. sapiens; Lgi, L. gigantea; Mmu, M. musculus; Nv, N. virens; Nve, 

N. vectensis; Pdu, P. dumerilii; Pfl, P. flava; Tad, T. adhaerens; Tca, 

T. castaneum; Xla, X. laevis. Scale bar indicates the number of amino acid 
substitutions per site. 
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Extended Data Figure 5 | Phylogenetic tree including ANTP-class 
homeodomain families represented in sponges and three additional families 
characterized by the presence of YIT/YIS motifs. Neighbour-joining 

(TT, 1,000 replicates) bootstrap support values are in black, maximum- 
likelihood (LG+G 0.4, 1,000 replicates) bootstrap support values are in blue 
and Bayesian (LG+G 0.4) posterior probability values are in red. Only 
bootstrap support values equal to or above 500 are shown. All families except 
Cdx are collapsed for clarity. Ame, A. mellifera; Aqu, A. queenslandica; Bil, 
B. floridae; Cte, C. teleta; Dme, D. melanogaster; Dre, D. rerio; Edi, E. dichotoma; 
Gga, G. gallus; Hsa, H. sapiens; Lgi, L. gigantea; Mmu, M. musculus; Nv, 

N. virens; Nve, N. vectensis; Pdu, P. dumerilii; Pfl, P. flava; Tad, T. adhaerens; 
Tca, T. castaneum; Xla, X. laevis. Scale bar indicates the number of amino acid 
substitutions per site. 
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Extended Data Figure 6 | S. ciliatum scaffolds containing NK genes and 
A. queenslandica scaffold containing cluster of NK genes. S. ciliatum NK 
genes are indicated in blue and A. queenslandica NK genes are indicated in 


{+ 
Okb 10 kb 20 kb 


green (modified with permission from ref. 6). Annotation of the neighbouring 
genes (genes within 50 kb from the NK gene) in S. ciliatum was performed 
using BLASTp searches against the RefSeq database. 
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Extended Data Figure 7 | Additional expression patterns of ANTP-class 
homeobox genes in embryonic development and during metamorphosis. 
a-g, All of the investigated genes (except Hex, data not shown) are expressed in 
oocytes. The expression of SciNKA is detectable in all blastomeres of the 
cleavage-stage embryos, but the transcripts are concentrated in the corner-most 
cytoplasm, which becomes gradually partitioned to the cross cells (arrows). 
1-q, This subcellular localization of cross-cell-enriched transcripts is also 


observed for SciNanos, expression of which, similarly to SciNKA, becomes 
ultimately restricted to cross cells and macromeres in pre-inversion-stage 
embryos. r-v, In metamorphosing post-larvae, SciNKA is expressed in the cells 
of the outer layer (r), SciNKB and SciNKG in (possibly non-overlapping) 
fractions of cells in the inner cell mass (s, t); SciHex is weakly expressed 
throughout the inner cell mass (u), and SciNKC (v) and NKD (data not shown) 
are not detectable in the juveniles. Scale bars: 10 j1m (a—q); 25 tm (r-v). 
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a Stage i b = Stage ii c Stage iii 
Flattened postlarva Flat metamorphosing Hollow metamorphosing 
shortly after settlement postlarva with monaxons postlarva 


d_ Stage iv 


Spherical juvenile 


e Stagev 
Cylindrical juvenile with 
crown of monaxons 


f Young syconoid 
Field collected specimens 
with radial chambers 
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Sample Replica Collected 
Stage i 4 29/06/12 
2 29/06/12 
Stage ii 1 09/07/12 
Stage iii 1 13/07/12 
2 13/07/12 
3 13/07/12 
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Extended Data Figure 8 | Samples used for quantification of expression. 
a-f, Metamorphosis in S. ciliatum, with stages based on those described 
previously” with modifications. Stage I, approximately 12h after settlement: 
large flat cells derived from larval macromeres envelop the inner cell mass 
composed of former micromeres (a). Stage II, approximately 24h after 
settlement: single-axis spicules (monaxons) are produced by sclerocytes, which 
have differentiated from the inner cell mass cells (b). Stage III, 2-3 days after 
settlement: choanocytes that have differentiated from the inner cells mass 
cells form a single internal chamber (c). Stage IV, approximately 4 days 

after settlement: osculum (exhalant opening) forms at the apical end of the 
spherical juvenile; first tri-radial spicules become apparent (d). Stage V, 
approximately 10 days after settlement: the juvenile is elongated along the 
apical-basal axis, long straight spicules form a crown around the osculum (e). 
Young syconoid sponges, approximately 8 weeks after settlement (f). 

a-e, Photographs of live specimens in culture; photographs a-d are top (apical) 
views, cartoon representations of sections and photograph e are side views. 
Scale bars: 100 1m (a-e); 1 mm (f). g, Details of replicates used for the analysis. 
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Several hundred juveniles were used in each sample. h, Plot demonstrating the 
results of principal component analysis of the metamorphosis series and axial 
dissection series of non-reproductive adults calculated according to the 
previously described method” and using information about the top 500 
differentially expressed genes as default parameters. Metamorphosis stages and 
parts of sponges are colour coded, with the ovals added manually for easier 
visualization of similarities and differences between the samples. Progress of 
development, starting from freshly released larvae and until the emergence 
of adult, but not yet reproductive sponges, is indicated by arrows. Note 
similarities of samples within replicates and with neighbouring stages of the 
metamorphosis series, and distinctiveness of the top (apical) samples from 
the basal and middle samples of the adults. i, Heat-map representation of 
sample-to-sample distances among all samples used in this study, calculated 
according to the previously described method” and based on expression of all 
coding genes in S. ciliatum (approximately 18,000 sequences). Note that 
replicates and neighbouring stages group together, as indicated by highlighting. 
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Extended Data Figure 9 | Heat-map representation of expression profiles. As demonstrated in Fig. 3b but with data from individual libraries presented 


separately. 
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Extended Data Table 1 | Quantification of differences in expression levels between top, middle and bottom parts of non-reproductive adult 


specimens of S. ciliatum 
Diff. expression Diff. expression 
Bons Expression level middle-top bottom-top 
Fold Adj. Fold Adj. 
Top Middle Bottom | change 
1 0 ; : 


change 


“Expression level’ was calculated as the sum of the posterior probability of each read coming from a given gene over all reads*® scaled using size factors of the libraries**; ‘fold change’ was calculated between 
expression levels in the middle (middle-top) or bottom (bottom-top) and top part of the sponge. adj. P value, P value adjusted for multiple testing with the Benjamini-Hochberg procedure”. diff., differential; Inf, 
positive infinity; NA, not applicable. 

*Values with statistical significance not less than 90% (adjusted P value < 0.1) and apical expression levels higher than in the middle or bottom part of the sponge. 
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Non-equivalent contributions of maternal and 
paternal genomes to early plant embryogenesis 


Gerardo Del Toro-De Leon", Marcelina Garcia-Aguilar' & C. Stewart Gillmor! 


Zygotic genome activation in metazoans typically occurs several 
hours to a day after fertilization, and thus maternal RNAs and pro- 
teins drive early animal embryo development’. In plants, despite sev- 
eral molecular studies of post-fertilization transcriptional activation, 
the timing of zygotic genome activation remains a matter of debate. 
For example, two recent reports that used different hybrid ecotype 
combinations for RNA sequence profiling of early Arabidopsis em- 
bryo transcriptomes came to divergent conclusions. One identified 
paternal contributions that varied by gene, but with overall maternal 
dominance’, while the other found that the maternal and paternal 
genomes are transcriptionally equivalent’. Here we assess paternal 
gene activation functionally in an isogenic background, by perform- 
ing a large-scale genetic analysis of 49 EMBRYO DEFECTIVE genes 
and testing the ability of wild-type paternal alleles to complement 
phenotypes conditioned by mutant maternal alleles. Our results dem- 
onstrate that wild-type paternal alleles for nine of these genes are com- 
pletely functional 2 days after pollination, with the remaining 40 genes 
showing partial activity beginning at 2, 3 or 5 days after pollination. 
Using our functional assay, we also demonstrate that different hybrid 
combinations exhibit significant variation in paternal allele activation, 
reconciling the apparently contradictory results of previous transcrip- 
tional studies*’. The variation in timing of gene function that we observe 
confirms that paternal genome activation does not occur in one early 
discrete step, provides large-scale functional evidence that maternal 
and paternal genomes make non-equivalent contributions to early 
plant embryogenesis, and uncovers an unexpectedly profound effect 
of hybrid genetic backgrounds on paternal gene activity. 

In animals, maternal RNAs and proteins drive early embryo develop- 
ment. The transition to zygotic control requires both clearing of maternal 
transcripts and zygotic genome activation, a phenomenon collectively 
known as the maternal-to-zygotic transition’. Research in plants has 
focused almost exclusively on zygotic genome activation, and has led to 
differing conclusions on its timing. Studies in maize and in tobacco sug- 
gested that large-scale zygotic transcription does not begin until 3-4 
days after fertilization*”. In agreement with this, an Arabidopsis RNA 
sequence profiling and reporter line study” and three smaller-scale mole- 
cular studies®* also came to the conclusion that many paternal alleles are 
either quiescent or transcribed at lower levels than corresponding mater- 
nal alleles in the first few days after fertilization. 

Other work has suggested a contrary view, namely that zygotic tran- 
scription begins immediately or soon after fertilization. Significant dif- 
ferences have been found between the transcript populations of egg cells 
and two-celled embryos in wheat’, and equivalent maternal and pater- 
nal expression for 25 different genes was observed in maize zygotes at 
1 day after pollination’®. In Arabidopsis, three genes have been shown by 
genetic means to be expressed from the paternal allele in early embryo- 
genesis'’"’, and a recent genome-wide transcriptional study found 
biparental transcription for nearly all of the ~8,000 genes that could be 
assayed at the one- to two-cell zygote stage’. Even taking into account that 
timing of zygotic genome activation may differ among plant species, it 
has been difficult to reconcile these conflicting results. However, when 


evaluating the two RNA profiling experiments arguing for’ or against” 
immediate zygotic genome activation in Arabidopsis, it is important to 
note that each study used a unique combination of polymorphic strains 
to distinguish maternal and paternal transcripts, such that the results 
are based on gene expression in embryos hybrid for different combina- 
tions of ecotypes. 

To study paternal allele activation in an isogenic background, we used 
a genetic assay with strict functional criteria: a test of the ability of wild- 
type (WT) paternal alleles to complement early embryo mutant pheno- 
types conditioned by mutant maternal alleles. While this assay is necessarily 
limited in the number of genes that can be tested (and in the corresponding 
conclusions that can be drawn), it presents two significant advantages 
over RNA sequencing approaches: by design, we only evaluate genes with 
a proven role in embryo development, leading to biologically relevant 
results. Second, we avoid the complex effects on quantitative trait-—such 
as gene expression values—that necessarily result from crossing poly- 
morphic, distantly related ecotypes (the basis for allele specific profil- 
ing by RNA sequencing)'*"””. 

We screened a collection of mutants in several hundred EMBRYO 
DEFECTIVE (EMB) genes required for early embryo development in 
Arabidopsis'*”’, and selected genes that were associated with clearly 
recognizable anatomical defects segregating in emb/+ plants at pre- 
globular stages (Supplementary Table 1 and Supplementary Data Set 1). 
For 49 different EMB genes, we then crossed heterozygous mutant plants 
with WT pollen, and scored the resulting embryos for mutant pheno- 
types at 2, 3, 5 and ~ 14 days after pollination (Fig. 1; complete data shown 
in Supplementary Table 2). We identified nine genes where the WT pater- 
nal allele complemented the mutant maternal allele as soon as there was a 
requirement for the gene product, resulting in no observed mutant phe- 
notypes at 2 days after pollination or later (AtSWI3A shown as an example 
in Fig. 1d-f, j). For the remaining 40 genes, the WT paternal allele gradual- 
ly complemented the mutant phenotype conferred by the mutant mater- 
nal allele (EMB2804 shown as an example in Fig. 1g-i, j). To determine 
more precisely the time point of the onset of paternal allele activation, 
we studied a subset of 13 genes in more detail (Supplementary Table 3). 
Comparison of mutant ratios resulting from hand-pollinated emb/+ X 
emb/+ and emb/+ X Col crosses identified genes whose activation began 
at either 2, 3 or 5 days after pollination. By contrast, Col X emb/+ control 
crosses resulted in no mutant phenotypes, showing that the delayed com- 
plementation we observed does not reflect haploinsufficiency expressed 
only in the first few days of embryogenesis, and that the maternal allele 
for these genes is absolutely required in early embryogenesis, while the 
paternal allele is not (Supplementary Table 3). 

We next sought to correlate function and expression, selecting the 
GCT and CCT genes as examples”. Figure 2a-g, i-o shows that mater- 
nally derived gGCT-GUS and pCCT::GUS reporter constructs drove 
embryo expression at 2 days after pollination, whereas paternally derived 
constructs drove embryo expression beginning at 3 days after pollina- 
tion, and reached full expression at 4 days after pollination. Phenotypic 
complementation of gct and cct mutant maternal alleles by the WT 
paternal allele began at 5 days after pollination (Fig. 2h, p), 1 day after 


1Laboratorio Nacional de Genémica para la Biodiversidad (Langebio), Unidad de Gendémica Avanzada, Centro de Investigacion y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), 


lrapuato, Guanajuato 36821, México. 
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Col WT x Col WT 


atswi3a/+ x Col WT 


pa 
Sof /ME: 


emb2804/+ x Col WT 


Figure 1 | Paternal allele activation in early Arabidopsis embryogenesis 
varies between genes. a—c, Normal phenotypes resulting from hand self- 
pollinated WT plants. DAP, days after pollination. d-f, Normal phenotypes 
resulting from atswi3a/+ X Col WT. For comparison purposes, the inset in f 
shows the phenotype ofa heart stage atswi3a/— embryo froma selfed atswi3a/+ 
plant. g-i, Mutant (Mut) phenotypes resulting from emb2804/+ < Col WT. In 
a-i, the percentage of embryos with WT or mutant phenotypes at 2, 3 and 

5 days after pollination is shown, with the total number of embryos examined 
in parentheses. In crosses, the maternal genotype is listed first. Embryos shown 
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Figure 2 | Delayed molecular and functional activity for GCT and CCT 
paternal alleles in early embryogenesis. a-g, Expression of paternally and 
maternally derived gGCT-GUS at 2, 3, 4 and 5 days (d) after pollination. 

h, Quantification of phenotypically mutant embryos derived from hand- 
pollinated selfed gct/+ plants, or crosses with WT pollen (1 > 90 for all time 
points). i-o, Expression of paternally and maternally derived pCCT::GUS at 2, 
3, 4 and 5 days after pollination. p, Quantification of phenotypically mutant 
embryos derived from hand-pollinated selfed cct/+ plants, or from crosses with 
WT pollen. For all crosses, the maternal genotype is listed first. Embryos in 
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at equal magnification; scale bar, 10 jum. j, Heat map summarizing the 
percentage of mutant phenotypes observed at 2, 3, 5 and ~14 days after 
pollination in embryos resulting from emb/+ X Col WT crosses, for 49 genes 
tested. Full paternal allele activity corresponds to 0% observed mutants, 
while complete lack of paternal allele activity corresponds to 50% observed 
mutants. Genes are clustered based on the similarity of their complementation 
profiles. For each time point, the combined number of embryos examined in 
at least two biological replicates is listed in Supplementary Table 2. See 
Methods for additional detail on crosses. 
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a, d, iand | are indicated with arrowheads. Embryos in f and n indicated with a 
dotted line. Scale bar, 20 um. Results of a two-tailed Fisher’s exact test, 
performed to determine whether the two samples were statistically different at 
each time point, are shown above the columns in g, h, o and p: ***P < 0.001; 
**P < 0.01; NS, no significant difference between the two samples. For each 
time point, the combined number of embryos examined in at least two 
biological replicates, as well as exact P values for results of two-tailed Fisher’s 
exact test, are listed in Supplementary Table 4. See Methods for additional 
details on crosses. 
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the corresponding B-glucuronidase (GUS) reporter lines reached full 
expression (Fig. 2g, o and Supplementary Table 4). Thus, molecular 
and functional data indicate that the WT paternal alleles for GCT and 
CCT show a gradual increase in activity in early embryogenesis. 

Our data provide clear genetic evidence that most paternal alleles 
investigated in this study show gradual activation during the first few 
days of embryogenesis, whereas others are fully functional by 2 days 
after pollination. These findings are entirely consistent with studies of 
reporter gene expression in early isogenic embryos that found a gradual 
increase in transcription of most paternal alleles”®’, as well as a smaller 
number of genes with equal maternal and paternal expression*"'. Our 
results are also generally consistent with the overall conclusions derived 
from transcriptome profiling of embryos hybrid between the Landsberg 
erecta (Ler) and Columbia (Col) ecotypes, which predicted 65% of genes 
to have some paternal transcription at the two- to four-cell stage (with 
overall maternal bias)’. However, our findings contrast with the equal 
maternal and paternal transcription discovered in early embryos hybrid 
between the Col and Cape Verde Islands (Cvi) ecotypes’. If early, equi- 
valent zygotic transcription were the case in isogenic Col, we would expect 
to find no phenotypically mutant embryos resulting from emb/+ X Col 
crosses for all of the 49 EMB genes we tested. In fact we found mutant 
phenotypes segregating for 40 genes, 19 of which were absent in egg and 
29 in sperm cell transcriptomes”’, demonstrating that loss of maternal 
contribution from the egg is not an overall explanation for lack of imme- 
diate complementation in emb/+ x Col crosses (Supplementary Table 2). 
Forty-four of the 49 genes in our assay are also expressed in early endo- 
sperm (2-5 days after pollination)” (Supplementary Table 5), although 
none of these genes show imprinted expression”’, and no allele-specific 
expression data exist, precluding any conclusion on the possibility of 
endosperm-embryo messenger RNA (mRNA) mobility having an effect 
on our complementation experiments™. 

On a gene-by-gene basis, comparison of our functional data for iso- 
genic Col with transcriptional results for Ler/Col’ and Cvi/Col’ hybrids 
revealed numerous discrepancies (Supplementary Table 6). One simple 
explanation for this divergence is that gene activity is indeed different 
between isogenic and hybrid embryos, and varies between hybrid com- 
binations, as previously seen when comparing hybrid seedlings with 
their parental lines'*””. To test this hypothesis, we crossed 11 different 
emb/+ lines (representing varying degrees of delayed paternal activity) 
with pollen from Ler, Cvi, the common laboratory strain Wassilewskija 
(Ws) and Tsu (an ecotype distantly related to Col)**, and compared the 
resulting segregation ratios with those for emb/+ X Col (Fig. 3a). For all 
EMB genes, we found significant differences between isogenic Col em- 
bryos and at least one hybrid combination, and for six genes we found 
significant differences between isogenic Col and three or more hybrids; 
in almost all instances, hybridization caused earlier gene activation com- 
pared with isogenic Col (Supplementary Table 3). In general, crosses 
with Cvi showed the most differences with isogenic Col, crosses with Tsu 
were least different, and crosses with Ler and Ws were intermediate. 
Importantly, directly comparing Col/Ler and Col/Cvihybrids resulted in 
significant differences for 6 out of 11 genes. These trends were especially 
clear after principal component analysis of our data, in which PC1 
separates Col, Tsu and Ws from Ler and Cvi, and PC2 further separates 
Ler and Cvi from Col, and from each other (Fig. 3b). Thus, significant 
variation in early paternal allele activity exists between isogenic Col 
embryos and embryos hybrid for Col and a second ecotype. In particular, 
direct comparison of Col/Ler and Col/Cvi hybrids revealed large differ- 
ences, consistent with transcriptional profiling’. 

In summary, our data conclusively demonstrate that discrete, gen- 
ome-wide zygotic activation does not occur in isogenic Columbia, the 
most commonly used ecotype of Arabidopsis. Instead, we show that 
gene activation varies, and individual contributions must be determined 
on a case-by-case basis. We have also uncovered an unexpectedly large 
effect of hybridization on gene activation in early embryogenesis, pro- 
viding a simple explanation for different conclusions regarding zygotic 
genome activation derived from transcriptional profiling of different 
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Figure 3 | Gene activation kinetics differ between isogenic and hybrid 
embryos. a, Heat map representations of mutant phenotype percentages 
observed for 11 different emb/+ mutants for self crosses (X emb/+), and 
crosses to Col-0 (X Col), Tsu-1 (X Tsu), Ws (X Ws), Ler (X Ler) and Cvi-0 
(X Cvi). b, Principal component analysis of mutant phenotype segregation 
ratios observed in emb/+ crosses to all WT ecotypes. For each time point, the 
combined number of embryos examined in at least two biological replicates is 
listed in Supplementary Table 3. See Methods for additional details on crosses. 
All emasculations and pollinations were done by hand. The emb mutants 
were in the Col ecotype. See Supplementary Table 3 for complete data and 

P values for the results of a two-tailed Fisher’s exact test, performed to 
determine whether two samples were statistically different at each time point. 


hybrid combinations”’. The varying effects on gene activation resulting 
from different combinations of inbred strains of Arabidopsis may be 
instructive for future research on the molecular basis of heterosis in seed 
crops. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Plant growth conditions. Plants were grown in growth chambers at 22 + 2 °C in 
16/8-h light/dark cycles at constant humidity rate and 100 pmol m” fluence. Plants 
were grown in a mixture of vermiculite (GRACE MAN-FIN), perlite (AGROL125) 
and sunshine mix (PREMEZ FWSS3) (1:1:3 v/v/v). Seeds were sown directly on the 
soil, and given a 3-day cold treatment to promote germination, before being moved 
to growth chambers. 

Crosses, histology and microscopy. Heterozygous plants were identified based on 
segregation of emb phenotypes. Crosses were performed using an emb/+ plant as 
female parent and WT plants as the male parent. Heterozygous plants were emascu- 
lated, and pollinated 2 days later. The same ecotype was always used for both parents; 
with the exception of yda-1 (Ler ecotype), all mutants were in the Columbia ecotype. 
The seeds were cleared with a modified Hoyer’s solution consisting of 7.5 g gum 
arabic, 100 g chloral hydrate, 5 ml glycerol and 30 ml water, diluted 1:1 with a dilution 
solution consisting of 7.5 g gum arabic, 5 ml glycerol and 30 ml water. Siliques were 
collected at 2, 3 and 5 days after pollination, and the ovules were cleared and observed 
as mentioned above. Embryos were observed on a Leica DM6000 B Nomarski micro- 
scope. For the time point at ~ 14 days after pollination, seed abortion was quantified 
macroscopically with a dissecting microscope. For each cross, we performed at least 
two biological replicates, meaning crosses were performed with at least two different 
emb/-+ plants as mothers. Our goal was to score at least 100 embryos for each genotype 
for each time point, although owing to occasional variability in seed set, for a small 
number of time points we scored fewer than 100 embryos. Genotypes for which there 
are large numbers of embryos scored (for example, X Col reported in Fig. 3 and Sup- 
plementary Tables 2 and 3) are due to performing control crosses for multiple hybrid 
crosses. 

GUS staining analysis. pCCT::GUS and gGCT-GUS plants were emasculated, and 
pollinated 2 days later using WT Col pollen. The reciprocal crosses were conducted 
the same way. Developing siliques at 2, 3, 4 and 5 days after pollination were dissected 
to expose the ovules. Ovules were then punctured with forceps and incubated in 
permissive GUS staining buffer (0.2% Triton X-100, 2 mM potassium ferrocyanide, 
2mM potassium ferricyanide, 2 mM X-Gluc in N,N-dimethylformamide (DMF) in 
50 mM sodium phosphate buffer pH 7.2) overnight in a humid chamber at 37 °C. The 
tissue was cleared in 50% glycerol and observed on a Leica DM6000 B microscope 
under Nomarski optics. 

Construction of GCT and CCT GUS reporters. A pCAMBIA 3301 plasmid was 
modified by replacing the GUS gene with the GUS* gene from pCAMBIA 1305.1, 
producing the 3301 GUS* vector. The gGCT-GUS translational fusion of GUS to the 


GCT protein was constructed by cloning the 900 base pair GCT 5’ upstream region 
(from the GCT translational start up to the previous gene) into the 3301 GUS™ vector, 
in front of GUS". Then, the entire GCT genomic region, as well as the 3’ untranslated 
region corresponding to the At1g55325.2 transcript, was cloned in-frame downstream 
of GUS*, with a NAAIRS linker in between GUS* and the GCT protein. Thus, this 
construct contained all immediate upstream, downstream and intron elements of the 
GCT gene. The gGCT-GUS construct was transformed into the Col ecotype, and more 
than 100 T1 plants expressing GUS were examined before choosing a represent- 
ative line to use for detailed characterization. The expression pattern conferred by 
this construct exactly matches the native mRNA expression pattern observed for 
GCT during embryogenesis”. 

The 3301 GUS" vector was modified by replacing GUS* with a GUS" that included 

the nuclear localization signal LQPKKKRKV from ref. 26, placed at the carboxy 
terminus of GUS", to produce the 3301 GUS* NLS vector. The pCCT::GUS tran- 
scriptional reporter was constructed by cloning the 4.9 kilobase CCT upstream region 
(from the CCT translational start up to the previous gene) into the 3301 GUS* NLS 
vector, in front of GUS* NLS. Then, the CCT3’ untranslated region corresponding 
to the At4g00450.1 transcript was cloned after the GUS* NLS gene. Thus, this CCT 
transcriptional reporter contained all immediate 5’ and 3’ upstream and downstream 
regions corresponding to the CCT gene. The pCCT::GUS construct was transformed 
into the Col ecotype, and more than 100 GUS-positive T1 plants were examined 
before choosing a representative line to use for detailed characterization. The expres- 
sion pattern conferred by this construct exactly matched the native mRNA expres- 
sion pattern observed for CCT during embryogenesis”. 
Statistical analysis. To determine whether mutant phenotype segregation ratios 
differed significantly from 25% for selfed emb/+ data (as presented in Supplementary 
Table 1), the Microsoft Excel CHITEST function was used. The two-tailed Fisher’s 
exact test (www.langsrud.com/fisher.htm) was used to determine statistically signifi- 
cant differences for data in Supplementary Tables 2-4. Heat maps in Figs 1 and 3 were 
made in R with the heatmap.2 function of the data visualization package gplots 2.11.0; 
hierarchical clustering of rows was performed using the Ward method. Principal com- 
ponent analysis used the R package with the prcomp function. To illustrate the rela- 
tionships among the variables (ecotypes) quantitatively, standard deviation percentages 
for each principal component were calculated as shown in Fig. 3b. 


26. Matsushita, T., Mochizuki, M. & Nagatani, A. Dimers of the N-terminal 
domain of phytochrome B are functional in the nucleus. Nature 424, 571-574 
(2003). 
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Oncogene ablation-resistant pancreatic cancer cells 
depend on mitochondrial function 


Andrea Viale'**, Piergiorgio Pettazzoni*, Costas A. Lyssiotis®, Haoqiang Ying’, Nora Sanchez”, Matteo Marchesini‘, 
Alessandro Carugo’**, Tessa Green, Sahil Seth®, Virginia Giuliani’, Maria Kost-Alimova’, Florian Muller’, Simona Colla’, 
Luigi Nezi>?, Giannicola Genovese’, Angela K. Deem!, Avnish Kapoor!, Wantong Yao”, Emanuela Brunetto®, Ya’an Kang’, 
Min Yuan®, John M. Asara®, Y. Alan Wang’, Timothy P. Heffernan”, Alec C. Kimmelman’, Huamin Wang”, Jason B. Fleming’, 


Lewis C. Cantley’, Ronald A. DePinho" & Giulio F. Draetta’? 


Pancreatic ductal adenocarcinoma (PDAC) is one of the deadliest can- 
cers in western countries, with a median survival of 6 months and an 
extremely low percentage of long-term surviving patients. KRAS mu- 
tations are known to bea driver event of PDAC’, but targeting mutant 
KRAShas proved challenging’. Targeting oncogene-driven signalling 
pathways is a clinically validated approach for several devastating 
diseases**. Still, despite marked tumour shrinkage, the frequency of 
relapse indicates that a fraction of tumour cells survives shut down 
of oncogenic signalling**. Here we explore the role of mutant KRAS 
in PDAC maintenance using a recently developed inducible mouse 
model of mutated Kras' (Kras@!”, herein KRas) ina ps3” WT back- 
ground. We demonstrate that a subpopulation of dormant tumour 
cells surviving oncogene ablation (surviving cells) and responsible for 
tumour relapse has features of cancer stem cells and relies on oxid- 
ative phosphorylation for survival. Transcriptomic and metabolic 
analyses of surviving cells reveal prominent expression of genes gov- 
erning mitochondrial function, autophagy and lysosome activity, 
as well as a strong reliance on mitochondrial respiration and a de- 
creased dependence on glycolysis for cellular energetics. According- 
ly, surviving cells show high sensitivity to oxidative phosphorylation 
inhibitors, which can inhibit tumour recurrence. Our integrated ana- 
lyses illuminate a therapeutic strategy of combined targeting of the 
KRAS pathway and mitochondrial respiration to manage pancre- 
atic cancer. 

Using our tetracycline-inducible KRas allele’, we controlled the expres- 
sion of KRas in a temporal and pancreas-specific manner. Upon doxycy- 
cline withdrawal, we observed regression of pancreatic tumours within 
2-3 weeks followed by relapse after 4-5 months, suggesting that a frac- 
tion of tumour cells survived oncogene ablation. To investigate the impact 
of KRas ablation in detail, we transplanted cells from primary tumours 
subcutaneously into recipient mice fed with doxycycline. When tumours 
reached a diameter of 1 cm, doxycycline was withdrawn and the lesions 
rapidly and apparently completely regressed (Fig. 1a and Extended Data 
Fig. 1a). However, analysis of residual scars detected epithelial remnants 
embedded in fibrotic tissue (Fig. 1b and Extended Data Fig. 1b, c). This 
phenotype was confirmed in vitro using a three-dimensional culture sys- 
tem in which cells from primary lesions were grown as spheres in semi- 
solid medium. After doxycycline withdrawal (Extended Data Fig. 1d, e), 
tumour spheres underwent regression due to apoptosis (Extended Data 
Fig. 1f), and onlya small population of dormant cells survived (Extended 
Data Fig. 1d, g). Notably, upon KRas re-activation, surviving cells (SCs) 
massively re-entered the cell cycle both in vitro and in vivo (Fig. 1c and 
Extended Data Fig. 1g, h) and rapidly reconstituted spheres and tumours, 


suggesting that subpopulations of cells differently addicted to KRas co- 
exist in pancreatic tumours. 

To assess the tumorigenic potential of SCs, we isolated KRas-expressing 
cells and SCs from tumour spheres (ex vitro) and tumours (ex vivo) and 
transplanted them in limiting dilution into recipient mice. Surprisingly, 
SCs were highly enriched in tumour-initiating cells (TICs). Virtually all 
SCs ex vitro initiated tumours in mice (TIC frequency >>1:5 versus 1:31 
in KRas-expressing cells; P< 0.001) (Fig. 1d and Extended Data Fig. 2a), 
and TIC frequency was similarly enriched in SCs ex vivo (1:10 versus 1:100 
in KRas-expressing cells) (Fig. 1d and Extended Data Fig. 2b). Then, to 
assess whether pharmacological ablation of oncogenic pathways could 
mimic the genetic suppression of KRas, we treated tumour spheres de- 
rived from a KRas constitutive mouse model’ with a combination of 
Mek1 (AZD8330) and dual PI3K/mTOR (BEZ235) inhibitors (Extended 
Data Fig. 2c). The treatment resulted in an enrichment of tumorigenic 
cells (TIC frequency 1:7 versus 1:47 for treated versus non-treated cells, 
respectively) (Fig. 1d and Extended Data Fig. 2d). Collectively, our data 
demonstrate that PDAC tumours are heterogeneous, and that a popu- 
lation of spherogenic and tumorigenic cells survives genetic and phar- 
macological ablation of oncogenic pathways. 

To exclude the possibility that SCs represent a more aggressive sub- 
clone of tumour cells, we performed exome sequencing of tumour cells 
during cycles of KRas activation-inactivation-reactivation (ON-OFF- 
ON cycles) and evaluated changes in the allelic frequency of single nucle- 
otide variants (SNVs), a hallmark of clonal selection. Mutational profiles 
did not show any significant modification in allelic frequencies before 
versus after ON-OFF-ON cycles (Fig. le and Extended Data Fig. 2e), 
demonstrating that tumours after KRas reactivation are genetically 
identical to their primary counterparts. While these data formally exclude 
genetic clonal selection among SCs, epigenetically driven clonal selec- 
tion of a more aggressive subclone remains possible. 

To characterize SCs further, we examined the expression of markers 
used to isolate cancer stem cells in human tumours*””°. We found that 
different subpopulations of tumour cells were differentially sensitive to 
KRas ablation; specifically, only CD133 * cp44hiz” cells avoided under- 
going massive apoptosis (Fig. 1fand Extended Data Fig. 1i). Consequently, 
tumour remnants in vivo are strongly positive for stem cell markers 
(Fig. 1g, h and Extended Data Fig. 2f, g). Together, the tumorigenicity 
and immunophenotypic similarity between SCs and previously identi- 
fied human pancreatic cancer stem cells*”° suggests that SCs may pos- 
sess cancer stem cell characteristics. 

We next performed a transcriptomic analysis of cells isolated from tu- 
mour spheres. Gene set enrichment analysis (GSEA) revealed significant 
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Figure 1 | Cells surviving oncogene ablation are enriched in tumorigenic 
cells. a, Tumour volume before/after KRas ablation (+/—KRas) (n = 6). 

b, Histology depicting tumour remnants (X10). c, Immunofluorescence of 
KRas-expressing tumour (+KRas), regressed tumour (—KRas) and regressed 
tumours 48 h after KRas re-activation (—KRas re-ON) for Ki67 (red), CD44 
(green) and 4',6-diamidino-2-phenylindole (DAPI; blue) (40). d, Limiting 
dilution transplantation, TIC frequency. Genetic model: +KRas (black) versus 
-KRas (grey) ex vitro (n = 4) or ex vivo (n = 2). Pharmacological 
downregulation: control (black) versus treated spheres (grey; AZD8330 plus 
BEZ235 (AZD+BEZ)) (n = 2). Ctrl, control. e, Exome sequencing: allele 


enrichment of genes involved in several metabolic pathways (for exam- 
ple, mitochondrial electron transport chain (ETC), lysosome activ- 
ity, autophagy, mitochondrial and peroxisomal B-oxidation) (Fig. li 
and Extended Data Fig. 3a—e), which suggested that SCs might have 
increased mitochondrial activity. Indeed, Ppargcla (PGCla),a key reg- 
ulator of mitochondrial biogenesis"’, was increased at the messenger 
RNA and protein levels in SCs (Fig. 2a and Extended Data Fig. 4a), and 
we detected PGCla accumulation in the nuclei of SCs in vivo (Fig. 2c). 
Furthermore, SCs from tumour spheres stained intensely for MitoTracker 
Green, a marker of mitochondrial mass (Extended Data Fig. 4b). These 
data were corroborated by increased expression of the mitochondrial 
marker VDAC1 in SCs in vitro and in vivo (Fig. 2b, d). 

We functionally validated our findings by measuring respiratory capac- 
ity. SCs had a fourfold increase in oxygen consumption rate (OCR) com- 
pared with KRas-expressing cells (118 versus 33 pmol min“ ', respectively; 
P= 0.001) (Fig. 2e). Likewise, both in vivo and in vitro, mitochondria 
of SCs that were either genetically or pharmacologically selected were 
more hyperpolarized (Fig. 2f and Extended Data Fig. 4c—f) and gener- 
ated more reactive oxygen species (ROS) (Extended Data Fig. 4g, h), indi- 
cating a more active ETC. We also detected morphological differences in 
mitochondria using transmission electron microscopy (TEM) (Fig. 2). 
Because transmembrane mitochondrial potential regulates the mitochon- 
drial permeability transition pore (lower potential = lower threshold for 
pore opening) and because cells positive for stem cell markers in KRas- 
expressing tumours have hyperpolarized mitochondria (Extended Data 
Fig. 4i, j), the higher mitochondrial potential per se could explain why cells 
positive for stem cell markers are less prone to KRas ablation-mediated 


frequencies after KRas re-activation in SCs (Re-ON) versus KRas-expressing 
cells (Reference) at 40,383 and 44,182 SNVs for two independent tumours. 
f, Annexin V in spheres +/—KRas with respect to CD44/CD133 expression. 
Percentage of live, apoptotic (Apo) and necrotic (Nec) cells are reported 

(n = 3). g, Immunohistochemistry of —KRas tumours for CD44 (blue) and 
CD133 (red) (X20-40). h, Immunophenotyping of +/—KRas tumours 

for CD44/CD133/aldefluor. i, GSEA of pathways enriched in —KRas versus 
+KRas cells. Normalized enriched scores (NES) are reported. Data are 
mean = standard deviation (s.d.). 


apoptosis. Altogether, our data suggest that altered metabolic and mito- 
chondrial functions are critical features of SCs. 

SCs and KRas-expressing cells were next exposed to mitochondrial 
stressors. Treatment with oligomycin, a Fo-ATPase inhibitor of Complex 
V, significantly reduced mitochondrial respiration in both populations 
(Extended Data Fig. 5a). Conversely, normalization to basal respira- 
tion revealed different effects of the uncoupler carbonyl cyanide-p- 
trifluoromethoxyphenylhydrazone (FCCP) (Extended Data Fig. 5b), 
indicating that the mitochondria of SCs function near their maximal 
rate and have minimal spare respiratory capacity. Despite similar over- 
all responses to Complex V inhibition, ATP levels of SCs were dramat- 
ically reduced upon oligomycin treatment compared to KRas-expressing 
cells (Fig. 2h), suggesting a deficit in energy compensatory mechanisms. 
Indeed, KRas-expressing cells exposed to oligomycin strongly upregu- 
lated their extracellular acidification rate (ECAR) and lactate produc- 
tion, a compensation that did not occur in SCs (Fig. 2i and Extended 
Data Fig. 5c, e), confirming that SCs failed to increase glycolysis after 
oxidative phosphorylation (OXPHOS) inhibition in a manner sufficient 
to rescue the defects in ATP production. 

To assess comprehensively differences in metabolism, we performed 
a metabolomic analysis using a liquid chromatography coupled with 
tandem mass spectrometry (LC-MS/MS)-based platform”, revealing 
that several metabolic pathways were deregulated in SCs (Extended 
Data Fig. 5d). Consistent with the above findings, glycolytic intermedi- 
ates downstream of phospho-fructose kinase (PFK) were significantly 
less abundant in SCs versus KRas-expressing cells (Fig. 2j). The impaired 
glycolysis of SCs was independently validated by measuring glucose 
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uptake and lactate production in vitro (Fig. 2k, land Extended Data Fig. 6a) 
and in vivo (Extended Data Fig. 6b). The ratio between lactate and glu- 
cose in spent media supports this difference even more strongly (con- 
centration of lactate/concentration of glucose: KRas-expressing cells, 
16.9; SCs, 0.9). Importantly, SCs surviving pharmacological ablation 
of KRas showed a similar phenotype (Extended Data Fig. 6c, d). 

Wealso detected a decreased abundance of metabolic intermediates 
specific to the tricarboxylic acid (TCA) cycle in SCs (Extended Data 
Fig. 5f). We used carbon-13-labelled primary metabolic substrates to 
trace their contribution to central carbon metabolism. After 36 h label- 
ling, SCs relied less on glucose and glutamine and more on pyruvate 
and palmitate to generate TCA cycle intermediates and branching metab- 
olites (Fig. 2m and Extended Data Fig. 7e-h). This is consistent with 
reports describing the mutant KRas-mediated activation of anabolic 
glucose and glutamine metabolism in PDAC""*""*. It is also worth not- 
ing that SCs had lower levels of high-energy metabolites (Extended 
Data Fig. 5g) (compatible with less anabolic, dormant cells) and increased 
total glutathione (Extended Data Figs 5g and 8h). Importantly, any effect 
of cell cycle fluctuations on metabolic processes was excluded by our 
experimental design, as comparisons were made between quiescent SCs 
and quiescent fully formed KRas-expressing spheres (Extended Data 
Fig. 1g). In fact, we demonstrated that sphere formation is a dynamic and 
regulated process in which tumour cells expressing KRas exit the cell 
cycle when tumour spheres are fully formed. Thus, observed metabolic 
alterations can appropriately be attributed to an autonomous meta- 
bolic program. 

The lower energy levels and a lack of energetic compensatory mech- 
anisms in response to mitochondrial stressors in SCs suggested that 
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Figure 2 | Surviving cells have more active 
mitochondria and impaired glycolysis. 

a, b, Immunoblot of +/—KRas cells derived 

from independent tumours probed with PGCla 
(a) and VDAC1 (b) antibodies. Numbers indicate 
different independent tumours. c, d, In vivo 
immunofluorescence for CD44 (green) and PGCla 
(red) (c), and CD44 and VDAC1 (red) (d) in 
+/—KRas tumours (X60). e, Oxygen consumption 
of +/—KRas cells (n = 7). f, In vivo mitochondrial 
potential of +/—KRas tumours (n = 3); 
representative flow cytometry of two tumours, as 
control uncoupler CCCP was added to acquired 
—KRas. g, TEM micrographs representing 
mitochondrial morphology (< 25,000). h, ATP 
levels of +/—KRas cells in response to oligomycin 
(Oli) or dimethylsulphoxide (DMSO; Ctrl) (n = 4). 
i, ECAR response of +/—KRas cells to glucose, 
oligomycin and 2-deoxy-D-glucose (2DG) (n = 4). 
j, Fold change of glycolytic intermediates in 
+/—KRas cells (n = 4). Ala, alanine; BPG, 
bisphsphoglycerate; DHAP, dihydroxyacetone 
phosphate; F6P, fructose-6-phosphate; F(1,6)BP, 
fructose-1,6-bisphosphate; G6P, glucose-6- 
phosphate; G1P, glucose-1-phosphate; 

Ga3P, glyceraldehyde- 3-phosphate; G3P, 
glycerol-3-phosphate; Lac, lactate; PEP, 
phosphoenolpyruvate; 3PG, 3-phosphoglycerate; 
Pyr, pyruvate. k, 1, Glucose uptake (k) and lactate 
production (I) of +/—KRas cells (n = 3). 

m, Isotopomer distribution for «-ketoglutarate 
(aKG) following steady-state tracing with 
uniformly carbon-13-labelled substrates (n = 3). 
Data are mean + s.d. 


CD44 VDAC1 


aKG M+0 
n aKG M+1 
maKkG M+2 
makG M+3 
makG M+4 
maKkG M+5 


*P < 0.05 
**P < 0.01 


2 
© 
s 
2 
S 
a 
+ 


1 Pyruvate 


treatment with OXPHOS inhibitors might impact their survival. As ex- 
pected, even a short, 24 h exposure to oligomycin completely abrogated 
the ability of SCs to reform tumour spheres upon KRas re-expression, 
whereas KRas-expressing cells were unaffected (Fig. 3a and Extended 
Data Fig. 8a). Similar effects were observed with other OXPHOS inhi- 
bitors, although with less dramatic effects relative to Complex V inhibi- 
tors (Extended Data Fig. 8b). To determine whether OXPHOS inhibition 
synergized with KRas ablation in vivo, we induced tumour regression 
for 2 weeks via doxycycline withdrawal, treated mice with oligomycin 
(0.5 mgkg ' daily, intraperitoneal (i-p.)) or vehicle, and subsequently 
re-induced KRas and tumour relapse (Extended Data Fig. 8c—e). While 
mice treated with vehicle relapsed immediately (median survival 15 days, 
maximal survival 19 days), 25% of oligomycin-treated mice survived 
>60 days (P < 0.0001) (Fig. 3b and Extended Data Fig. 8f). Oligomycin 
treatment also strongly abrogated the spherogenic potential of SCs treated 
with AZD8330 plus BEZ235 independently of their p53 status (Fig. 3c). 
Because OXPHOS inhibitors induce ROS"’ and SCs have elevated basal 
mitochondrial ROS levels (Extended Data Fig. 4g and Extended Data 
Fig. 8g), we excluded the possibility that the observed synthetic lethality 
of oligomycin was caused by oxidative stress (Extended Data Fig. 8h-k). 

To explore the therapeutic potential of inhibiting OXPHOS in human 
tumours, we derived spheres from early passage patient-derived xenografts’. 
Similar to the mouse model, combination treatment with AZD8330 and 
BEZ235 for 1 week strongly decreased sphere formation (Extended Data 
Fig. 9a—c), sparing cells endowed with spherogenic potential, increased 
mitochondrial potential (Extended Data Fig. 9d) and, most importantly, 
unable to upregulate ECAR in response to OXPHOS inhibition (Fig. 3d). 
As with mouse cells, short exposure of human SCs to oligomycin 
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dramatically decreased their spherogenic potential (Fig. 3e). Similarly, 
transient genetic downregulation of TFAM (a key regulator of mito- 
chondrial transcription’’) and TUFM (a GTPase that delivers amino- 
acylated transfer RNAs to mitochondrial ribosomes”’) decreased the 
spherogenic potential of human cells surviving combined AZD8330 
plus BEZ235 treatment (Extended Data Fig. 9e-h). Altogether, these 
data highlight mitochondrial respiration as an attractive, druggable 
target that may effectively eradicate SCs in PDAC. 

The low energy state of SCs suggested that these cells may be experi- 
encing metabolic stress. We thus measured AMPK phosphorylation at 


Thr 172, an established indicator of metabolic stress. Surprisingly, AMPK 
phosphorylation was lower in SCs versus KRas-expressing cells (Fig. 4a); 
however, marked induction of AMPK phosphorylation was observed 
specifically in SCs upon oligomicyn treatment, stressing again the impor- 
tance of ETC activity for their energy production (Fig. 4b and Extended 
Data Fig. 9i). 

To elucidate further the metabolic mechanisms active in SCs, we inves- 
tigated other targets identified by transcriptomic analysis, including auto- 
phagy and B-oxidation. Autophagy and lysosome activity, two of the most 
activated pathways on the basis of GSEA (Fig. li), have an important 
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Figure 4 | Surviving cells are not in metabolic stress and activate autophagy. 
a, Immunoblot of Thr-172-phosphorylated and total AMPK, Ser-79- 
phospho-acetyl-coA-carboxylase (pACC) and f-actin in +/—KRas cells for 
two independent tumours. b, Immunoblot of —KRas cells treated or not with 
oligomycin (Oli) probed with anti-Thr-172-phospho-AMPK and -actin 
antibodies for three independent tumours. c, Immunoblot of LC3 and f-actin 
in +/—KRas cells for two independent tumours. d, Autophagic flux of 
+/—KRas cells stably expressing GFP-LC3. Bafilomycin treatment rescues 
GFP (—KRas+ Baf) (n = 3); representative flow cytometry of two tumours. 


+KRas —KRas 


e, Lipid droplet quantification in +/—KRas cells (n = 4); representative flow 
cytometry of two tumours. f, Fusion between lipid droplets (red arrowheads) 
and autophagosomes (green arrowheads) in —KRas cells (TEM; 25,000). 

g, Confocal microscopy for lipid droplets (Lipidtox; red) and lysosomes 
(Lysotracker; green) in — KRas spheres. Hoechst, blue. h, Oxygen consumption 
of +/—Kras cells pre-treated with bafilomycin (Baf), etomoxir (Eto) or 
vehicle (Ctrl) (n = 4). i, Effect of bafilomycin and etomoxir on spherogenic 
potential of +/—Kras cells (n = 6). Data are mean = s.d. 
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role in the metabolism of SCs. The autophagic marker microtubule- 
associated protein light-chain 3 (LC3) was abundantly expressed in SCs 
(Fig. 4c), and lysosomes and autophagosomes at various stages of evo- 
lution were abundant (Extended Data Fig. 10a). To investigate the acti- 
vation of autophagic flux, we transduced KRas-expressing tumour spheres 
with LC3-green fluorescent protein (GFP) fusion protein. Cells stably 
expressing LC3-GFP were maintained in the presence or absence of 
doxycycline and subsequently analysed by flow cytometry. As shown 
in Fig. 4d, SCs had a dramatically reduced GFP signal (80%) compared 
with KRas-expressing cells, indicating that GFP was quenched upon 
autolysosome formation, as also confirmed by the treatment with bafi- 
lomycin, a vacuolar H*-ATPase inhibitor (Fig. 4d). 

Upon TEM analysis, we observed increased content of cytoplasmic 
lipid droplets in SCs (Extended Data Fig. 10b) that we quantified with 
neutral lipid-specific dyes (Fig. 4e). Importantly, and unique to SCs, we 
observed unambiguous evidence of microlipophagy”’—the fusion of lipid 
droplets with autophagosomes (Fig. 4f)—which we confirmed by dem- 
onstrating strong co-localization of lipid droplets and lysosomes upon 
staining in SCs (Fig. 4g). Consistent with observations that SCs catabo- 
lize more endogenous protein and fatty acid, tracing metabolic experi- 
ments illustrate that more carbon in central metabolism is unaccounted 
for from labelled substrates in SCs versus KRas-expressing cells (Extended 
Data Fig. 7e-h). To determine whether autophagy and microlipophagy 
may representa strategy for SCs to maintain their energetic balance”’”’, 
we explored their contribution in maintaining mitochondrial activity 
by measuring OCR in cells pre-treated with autophagic and B-oxidation 
inhibitors. Neither inhibitor affected OCR in KRas-expressing cells, 
whereas treatment with either bafilomycin or etomoxir both markedly 
reduced OCR in SCs (85% and 35%, respectively) (Fig. 4h). Importantly, 
both inhibitors also induced metabolic stress specifically in SCs (Extended 
Data Fig. 9j, k) and dramatically affected both their spherogenic poten- 
tial (decrease of >90% and 39%, respectively; P = 0.01) (Fig. 4i) and sur- 
vival (Extended Data Fig. 91). We therefore conclude that autophagy and 
microlipophagy are as critical as ETC activity for the survival of SCs. 

Weisolated and characterized tumour cells surviving the genetic and 
pharmacological ablation of oncogenic pathways, demonstrating that 
they have features of TICs and a distinct metabolic profile. Unlike highly 
glycolytic tumour cells, which are dependent on lactate production to 
regenerate NAD* to support continued glycolysis for their anabolic metab- 
olism (Warburg effect), SCs are less glycolytic and more dependent on 
mitochondrial respiration. Their dependence on OXPHOS for energy 
production is consistent with recent reports showing that both normal 
and leukaemic stem cells rely on mitochondrial respiration’. Impor- 
tantly, the strong dependence of ETC activity of SCs on autophagic and 
catabolic processes makes them more resistant to nutrient deprivation 
and environmental stressors. However, their inability to increase compen- 
satory fluxes (namely, glycolysis) following OXPHOS inhibition confers 
extreme sensitivity to inhibition of mitochondrial function, which dra- 
matically impacts their spherogenic and tumorigenic potential. Our find- 
ings hold important implications for tumour treatment, paving the way 
for targeting OXPHOS in association with oncogenic pathway inhibi- 
tors to eradicate SCs and prevent tumour relapse in pancreatic cancer. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. The inducible tetO-LSL-KRas©”, ROSA26-LSL-rtTA, p53" “7, p48-Cre 
mouse model was generated as described’. After weaning, mice were fed doxycy- 
cline (2 g]™ in drinking water supplemented with sucrose (20 g1— 1) and monitored 
for tumour development. Constitutive LSL-KRas@!?, p53'"/'”, Pdx1-Cre and 
LSL-KRas@"””, Ink4a/Arfi°?’*", Pdx1-Cre mice were generated as previously 
reported’. All animals were kept in FVB/C57BI6 mix or pure FVB backgrounds in 
a pathogen-free facility at the University of Texas MD Anderson Cancer Center 
(MDACC). All manipulations were performed under Institutional Animal Care and 
Use Committee (IACUC)-approved protocols. 

Antibodies, plasmids and chemical reagents. Primary antibodies used for flow 
cytometry, immunofluorescence, immunohistochemistry and immunoblotting were: 
CD44 (eBioscience, IM7), CD133 (eBioscience, 13A4; Abcam, 165118), phospho- 
p44/42 (Cell Signaling, D10), VDAC1 (Abcam, ab15895), PGCla (Abcam, ab54481), 
BrdU (Abcam, ab82421), Ras (05-516, Millipore), LC3A/B (Cell Signaling, 4108), 
phospho-AMPK« (Thr 172) (Cell Signaling, 40H9), AMPK (Cell Signaling, D63G4), 
TFAM (Cell Signaling, D5C8), TUEM (Abcam, ab175199), B-actin (sc-1615, Santa 
Cruz). Plasmids: pBABEpuro GFP-LC3 (ref. 26) (Addgene plasmid 22405), pLKO 
TFAM shRNAs (TRCN0000016093, 5'-CCGGCGTGAGTATATTGATCCAGAA 
CTCGAGTTCTGGATCAATATACTCACGTTTTT-3’; TRCN0000016095, 5’-C 
CGGGTAAGTTCTTACCTTCGATTTCTCGAGAAATCGAAGGTAAGAACT 
TACTTTTT-3’) and pLKO TUFM shRNAs (TRCN0000280863, 5’-CCGGGCT 
CACCGAGTTTGGCTATAACTCGAGTTATAGCCAAACTCGGTGAGCTTT 
TTG-3’; TRCN0000280864, 5’-CCGGGAGGACCTGAAGTTCAACCTACTCG 
AGTAGGTTGAACTTCAGGTCCTCTTTTTG-3’) (Sigma), Tet-inducible pLKO 
was provided by the Institute for Applied Cancer Science at MDACC. Chemical 
reagents: oligomycin A (Sigma), buthioninesulphoximine, (Sigma), 4-hydroxynonenal 
(Cayman Chemical), o-tocopherol (Sigma), N-acetylcysteine (Sigma), tetrakis (Cal- 
biochem), bafilomycin (Sigma), etomoxir (Sigma), dicyclohexylcarbodiimide (Sigma), 
venturicidin A (Sigma), rotenone (Seahorse), antimycin (Seahorse), doxycycline 
(Research Product International), AZD8330-AZD6244-BEZ235 (Selleckchem). 
Tumour culture. Patient-derived xenograft tumours were generated by transplant- 
ing subcutaneously in mice small tumour fragments isolated directly from surgical 
specimens according to Kim et al.'’. Patient-derived samples were obtained from 
consented patients under an Institutional Review Board (IRB)-approved protocol 
LABO07-0854 chaired by J.B.F. Cells derived from early passage patient-derived xeno- 
graft tumours (F1-F2) and primary mouse tumours were kept in culture as spheres 
in semi-solid media for <15-16 passages. Briefly, after explant, tumours were digested 
at 37 °C for 1 h (Collagenase IV-Dispase 4 mg ml; Invitrogen). Single-cell suspen- 
sions were plated in stem cell medium (SCM) MEBM (Lonza) supplemented with 
2mM glutamine (Invitrogen), B27 (Invitrogen), 20 ng ml ~ ' hEGE (PeproTech), 
20 ng ml | hEGE (PeproTech), 5 ug ml | h-Insulin (Roche), 0.5 1M hydrocorti- 
sone (Sigma), 100 LM B-mercaptoethanol (Sigma), 4 pgml? heparin (Sigma). Meth- 
ocult M3134 (StemCell Technologies) was added to SCM (final concentration 0.8%) 
to keep tumour cells growing as clonal spheres versus aggregates. Fully formed tumour 
spheres were collected and digested with 0.05% trypsin (Gibco) to single cells and 
re-plated in culture. KRas inducible system: for maintenance, tumour cells were 
kept in doxycycline (2 1g ml” '). For experiments, cells were plated in the presence 
or absence of doxycycline to select SCs. After 8 days in culture, tumour spheres or 
SCs were collected, washed and processed for experiments. 

In vitro treatments. For drug treatments, KRas-expressing spheres or surviving 
tumour cells after 8 days of doxycycline withdrawal were collected, washed, digested 
with trypsin and repeatedly counted (Countless, Invitrogen). Similar procedures 
were used for spheres derived from human tumours and from the KRas constitu- 
tive mouse model except that surviving cells were selected by treating tumour spheres 
with AZD8330 (10 nM) and BEZ235 (100 nM) for 1 week. Equal numbers of live 
KRas-expressing cells and surviving tumour cells (AZD8330 plus BEZ235 treated 
cells for the constitutive KRas and human tumours) were treated with oligomycin 
(200 nM for 24h), venturicidin (500 nM for 24h), DCCD (1 uM for 24h), bafilo- 
mycin (50 nM for 48 h) or etomoxir (100 [1M for 48 h). For OCR measurement and 
western-blot experiments, cells were treated with bafilomycin and etomoxir for 24 
and 6h, respectively. To test the effects of ROS on spherogenic potential, cells were 
treated with buthionine-sulphoximine (100 |1M) to deplete GSH or pre-treated with 
a-tocopherol (100 ,1M), N-acetylcysteine (1 mM) or Mn-tetrakis (50 |1M) before 
oligomycin treatment. After drug wash out, treated cells were re-plated for 7 days 
(on doxycycline when using the KRas-inducible system). The number of tumour 
spheres (spherogenic potential) was quantified using ImageXpress Velos Laser Scan- 
ning Cytometer (Molecular Devices) upon calcein staining (Molecular Probes). 
Tumour transplantation, limiting dilution and in vivo treatments. Tumour 
cells isolated from in vivo tumours or from in vitro spheres were digested to single 
cells (see earlier). Usually, 10*-10° tumour cells were used for routine transplanta- 
tion, for transplantation in limiting dilution 10°, 10° or 10 tumour cells were used. 
Tumour cells were suspended in SCM (see earlier) and Matrigel (BD Biosciences, 


LETTER 


356231) (1:1 dilution) and injected subcutaneously into the flank of 6- to 8-week- 
old female immunodeficient mice (NCR-NU, Taconic). If cells were derived from 
KRas-inducible tumours, mice were injected with doxycycline (3 mgkg™', ip.) at 
the time of transplantation and then fed with doxycycline in drinking water. Tumour- 
initiating cell (TIC) frequencies were determined by Poisson statistical analysis using 
L-Calc software (StemCell Technologies). For BrdU incorporation experiments, 
mice were injected i.p. with 1 mg of BrdU 3 times (every 8h) after 24h or 48h of 
KRas reactivation before being killed. For in vivo oligomycin treatment, mice were 
transplanted with tumour cells and fed with doxycycline in drinking water (+ Dox) 
until tumours reached 1 cm in diameter. At that time, doxycycline was withdrawn 
(—Dox) and, after 2 weeks, when tumours were almost completely regressed, mice 
were injected with oligomycin (0.5 mg kg’, i-p.; Sigma) or vehicle, 5 days a week, 
for 2 weeks. After treatment, doxycycline was added back to drinking water and mice 
were monitored for tumour relapse. Genetically identical and age matched recipi- 
ent mice were used for transplantation experiments and were randomly allocated 
for treatment with oligomycin upon tumour regression. Experimenters were not 
blinded to the experimental groups in evaluating treatment outcome. 

For pharmacological inhibition of KRas pathways using small molecule inhibi- 
tors, mice bearing tumours derived from the KRas constitutive system were treated 
with a combination of AZD6244 and BEZ235 (100 mgkg~ ' and 40 mgkg ' respec- 
tively, per oral gavage daily) for at least 7 days. Tumour volume was calculated using 
the formula: V = P X L/2 (where lis length and L is width). All manipulations were 
performed under IACUC-approved protocols. 

Isolation of tumour cells. To isolate pure populations of tumour cells from trans- 
planted tumours, we took advantage of dim-high ubiquitous expression of CD44 
in pancreatic tumour cells (Extended Data Fig. 2f). Single-cell suspensions of digested 
tumours were stained with anti-CD44 biotinylated antibody (eBioscience, IM7). 
Positive cells were then purified using Easy Sep Biotin Selection Kit (StemCell Tech- 
nologies) according to manufacturer’s instructions. Counterstaining of isolated 
cells with anti-CD45 and CD31 antibodies and fluorochrome-conjugated strepta- 
vidin were used to check the purity by flowcytometry. 

shRNA expression and gene downregulation. shRNAs against TFAM (TRCN00 
00016093; TRCN0000016095) and TUFM (TRCN0000280863; TRCN0000280864) 
(Sigma) were cloned in a Tet-inducible pLKO vector provided by the Institute for 
Applied Cancer Science at MDACC (for sequences see also earlier). Human tumour 
spheres were transduced with viral particles and selected with puromycin. Upon 
selection, tumour spheres were treated or not with a combination of Mek/Pi3K 
inhibitors (10 nM AZD83330 plus 100 nM BEZ235) for 1 week. On the second day 
after beginning the combination drug treatment, doxycycline was added to the cul- 
ture to induce shRNA expression and was maintained for 5 days. Then tumour cells 
were collected, washed, and replated to evaluate their spherogenic potential. Down- 
regulation of the target was evaluated by western blot at 72 h after shRNA induction. 
Flow cytometry and cell sorting. Single cells isolated from tumours or spheres 
were stained with primary antibodies after blocking with 10% BSA and 5% rat serum. 
Aldefluor (StemCell Technologies) and annexin V (eBioscience) staining were per- 
formed according to the manufacturer’s instructions. To study the cell cycle of tumour 
spheres, BrdU Flow Kit (BD Pharmingen) was used according to datasheet speci- 
fications. Mitochondrial potential in vitro was measured using MitoProbe DilC,(5) 
Assay Kit for Flow Cytometry (Molecular Probes) according to specifications, and 
CCCP treatment was used for controls. In vivo mitochondrial potential was evalu- 
ated according to Zheng et al.”’. Briefly, mice bearing KRas-expressing or regressed 
tumours (inducible model or pharmacologically treated with AZD6244 plus BEZ235) 
were injected with 25 nmol kg _' of TMRE (Molecular Probes) as a tail vein bolus. 
After 1h, mice were killed and tumours explanted, digested (as described earlier, 
adding 10% FBS to the digestion mix) and stained for CD44. Samples were kept on 
ice and immediately acquired gating CD44-positive DAPI-negative cells. The same 
approach, timeline and staining were used to evaluate glucose uptake in vivo, inject- 
ing 25 pmol kg * 2NBDG (Molecular Probes) asa tail vein bolus according to Ytoh 
et al.”*. For in vitro experiments, 2NBDG was used at a concentration of 10 1M in 
complete stem cell medium containing 2 mM glucose”. Cells were incubated for 
6h then washed and analysed by flow cytometry. MitoTracker Green and Deep Red 
(Molecular Probes) were used to measure mitochondrial mass. ROS were evaluated 
using MitoSOX and CellROX (Molecular Probes) and induced by 4-hydroxynonenal 
treatment (10 1M) for positive controls. LipidTox deep red (Molecular Probes) was 
used for quantifying lipid droplet content. All staining procedures were performed 
according to the manufacturer’s protocols. DAPI (Invitrogen) was used to stain DNA 
content or to exclude dead cells depending on the experiment. For measuring auto- 
phagic flux, KRas-expressing cells were transduced with pBABEPuro GFP-LC3 (ref. 26) 
and, upon selection, doxycycline was withdrawn for 8 days. Mean of fluorescence 
of surviving and matched KRas-expressing cells was quantified and surviving tumour 
cells treated for 24h with bafilomycin (50 nM) were used as a control. Gating strat- 
egies to exclude doublets and dead cells (DAPI) were always employed. After staining, 
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samples were acquired using a BD FACSCantolI flow cytometer or sorted using 
BD Influx cell sorter. Data were analysed by BD FACSDiva or FlowJo (Tree Star). 
Immunohistochemistry and immunofluorescence. Tumour samples were fixed 
in 4% formaldehyde for 2 to 4h on ice, moved in 70% ethanol for 12h, and then 
embedded in paraffin (Leica ASP300S). After cutting (Leica RM2235), baking and 
deparaffinization, slides were treated with Citra-Plus Solution (BioGenex) accord- 
ing to specifications. For immunohistochemistry (IHC) staining, endogenous per- 
oxidases were inactivated by 3% hydrogen peroxide. Non-specific signals were blocked 
using 3% BSA, 10% goat serum and 0.1% Triton. Tumour samples were stained 
with primary antibodies. For BrdU detection, samples were digested on slides for 
1h at 37 °C with DNase I (300 jg ml’) before staining. For IHC, ImmPress and 
ImmPress-AP (Vector Lab) were used as secondary antibodies and Nova RED, Vector 
BLUE and DAB were used for detection (Vector Lab). Images were captured with a 
Nikon DS-Fil digital camera using a wide-field Nikon EclipseCi microscope. For 
immunofluorescence, secondary antibodies conjugated with Alexa488 and 555 (Mole- 
cular Probes) were used. Images were captured with a Hamamatsu C11440 digital 
camera, using a wide-field Nikon EclipseNi microscope. LipidTox, Lysotracker, 
MitoTracker, CellRox and Hoechst 33342 (Molecular Probes) were used on live 
spheres and cells at the concentrations suggested by manufacturer’s protocols and 
images were acquired using a Nikon high-speed multiphoton confocal microscope 
A1RMP. 

Immunoblotting. Protein lysates were resolved on 5-15% gradient polyacrylamide 
SDS gels and transferred onto PVDF membranes according to standard proce- 
dures. Membranes were incubated with indicated primary antibodies, washed, and 
probed with HRP-conjugated secondary antibodies. The detection of bands was 
carried out upon chemiluminescence reaction followed by film exposure. For the 
Ras pull-down assay, the amount of active Ras was evaluated by detecting the frac- 
tion of Ras protein that co-precipitated with RAF kinase. Cell lysates from KRas- 
expressing cells and cells surviving KRas ablation were co-incubated with RAF-linked 
agarose beads for 2h. After incubation, beads were collected, washed and boiled 
for 5 min in the presence of laemmli loading buffer supplemented with 10% 2- 
mercapthoethanol and ultimately loaded onto SDS-PAGE gels. The detection of 
the active fraction of RAS was carried out using standard western blot procedures 
with anti-Ras antibody. 

Transmission electron microscopy. TEM was performed at the MDACC High 
Resolution Electron Microscopy Facility. Samples were fixed with a solution contain- 
ing 3% glutaraldehyde plus 2% paraformaldehyde in 0.1 M cacodylate buffer, pH 7.3, 
for 1h. After fixation, the samples were washed and treated with 0.1% Millipore- 
filtered cacodylate buffered tannic acid, post-fixed with 1% buffered osmium tetrox- 
ide for 30 min, and stained en bloc with 1% Millipore-filtered uranyl acetate. The 
samples were dehydrated in increasing concentrations of ethanol, infiltrated and 
embedded in LX-112 medium. The samples were polymerized in a 60 °C oven for 
2 days. Ultrathin sections were cut in a Leica Ultracut microtome, stained with uranyl 
acetate and lead citrate in a Leica EM Stainer and examined in a JEM 1010 trans- 
mission electron microscope (JEOL) at an accelerating voltage of 80kV. Digital 
images were obtained using an AMT Imaging System (Advanced Microscopy Tech- 
niques Corp). 

DNA, RNA, cDNA and qPCR. DNA and RNA were extracted using DNeasy Blood 
and Tissue Kit (Qiagen) and RNeasy Mini Kit (Qiagen) according to technical 
specifications. A mix of random hexamers and oligo(dT) were applied for cDNA 
synthesis using SuperScript III First-Strand-Synthesis System (Invitrogen). For qPCR, 
10 ng of DNA or cDNA was amplified with EXPRESS SYBR GreenER qPCR Super- 
Mix (Invitrogen) using a Startagene Mx3005p thermal-cycler. Primers used for mito- 
chondrial and lipid metabolic gene amplification are shown in Extended Data 
Fig. 10c. B-Actin (F-GACGGCCAGGTCATCACTAATTG, R-AGGAAGGCTG 
GAAAAGAGCC), 288 (F-TCATCAGACCCCAGAAAAGG, R-GATTCGGCA 
GGTGAGTTGTT) and B-microglobulin (F-ATTCACCCCACTGAGACTG, R- 
TGCTATTTCTTTCTGCGTGC) were used as house-keeping genes for normali- 
zation. Expression of genes involved in the ETC, mitochondria and autophagy was 
evaluated using Qiagen commercial arrays: RT2 Profiler PCR Array Mouse Mito- 
chondrial Energy Metabolism, Mouse Mitochondria and Mouse Autophagy. 
Expression profiling and data analysis. Gene expression profiling was performed 
at the Dana-Farber Cancer Institute Microarray Core Facility. RNA isolated from 
KRas-expressing and surviving tumour cells was hybridized on a Gene Chip Mouse 
Genome 430 2.0 Array (Affymetrix). Raw data (CEL files) were pre-processed using 
a robust multi-array analysis (RMA) and analysed with GSEA” using Signalling 
Pathways c2.cp.v3.0 and TFT c3.tft.v3.0 symbols gene sets. 

Exome sequencing. Two independent tumour spheres expressing KRas were col- 
lected, digested to single cells, and replated on two plates in the presence of doxy- 
cycline (ON Dox, +KRas) for 1 week. Spheres from one plate were collected and 
snap frozen for DNA extraction (Reference), while spheres from the other plate were 
plated OFF doxyclycline (—KRas) for 1 week before re-addition of doxycycline to 
re-express KRas. After 1 week, reformed spheres were collected and snap frozen for 


DNA extraction (Re-ON). Genomic DNA was extracted by phenol-chloroform and 
further purified from contamination on columns (Qiagen). The exonic DNA regions 
were captured using Nimblegen SeqCap EZ Mouse Exome kit. The DNA sequences 
recovered were processed through a standard SNP calling pipeline. Reads were 
aligned using BWA following removal of duplicates, realignment and recalibration 
(using Broad’s Genome Analysis Toolkit or GATK). Further SNPs were called using 
GATK’s Unified Genotyper and annotated using Annovar. For the analysis we con- 
sidered SNVs which were (1) called in both samples, and (2) had a minimum coy- 
erage of 200 in both, providing enough confidence to compare allelic frequencies 
between the sample pairs. Roughly 12% (40,383/335,076) and 13% (44,182/339,398) 
of the common SNVs for tumour #2 and tumour #1, respectively, satisfied the 
coverage threshold of 200. 

Metabolomics. For metabolomic analyses, surviving and KRas-expressing spheres 
(grown as described earlier), were collected after 8 days of culture when both KRas- 
expressing spheres and surviving tumour cells were confirmed to have exited the cell 
cycle (to avoid the confounding effects of proliferation on metabolism) (Extended 
Data Fig. 1g). The day before collection, medium was changed and spheres re-plated 
in fresh medium. After 24h, cells were collected by centrifugation, washed three 
times and samples were then immediately lysed in methanol:water (80:20) at dry- 
ice temperature. The quantity of the metabolite fraction analysed was adjusted to 
the corresponding protein concentration calculated upon processing a parallel sam- 
ple. Metabolite fractions were processed and analysed by targeted LC-MS/MS via 
selected reaction monitoring (SRM), as described’!*"*, Processed data were ana- 
lysed in Cluster 3.0 and TreeViewer. The analysis was performed on four indepen- 
dent tumours in biological triplicate. 

Carbon-13 metabolic tracing. Cells were prepared in a manner identical to that 
for routine metabolomic profiling (see earlier), except that media formulations were 
adjusted to account for isotopically labelled substrates. Specifically, surviving and 
KRas-expressing cells were plated in RPMI medium (reconstituted with growth 
factors) devoid of glucose, glutamine, pyruvate, palmitate and supplemented with 
one of the four carbon-13-labelled substrates where the remaining substrates were 
unlabelled (glucose 10 mM, glutamine 2 mM, pyruvate 1 mM and palmitate 75 1M). 
After 24h of incubation with labelled substrates, cells were collected and replated 
again in fresh media supplemented with carbon-13-labelled substrates for another 
12h to minimize the effects of cellular uptake on the concentration of different 
substrates. After 36 total hours of labelling, samples were collected by centrifuga- 
tion and immediately lysed in methanol:water (80:20) at dry-ice temperature. As 
for metabolomics, the quantity of the metabolite fraction analysed was adjusted 
to the corresponding protein concentration from a sample processed in parallel. 
The analysis was performed for each of the four substrates on three independent 
tumours in biological triplicate. Data were collected and processed as above, as 
described’!?"*. 

Oxygen consumption and glycolytic capacity. Tumour spheres and surviving cells 
were digested to single cells and spun into XF96 Cell Culture Microplates (Seahorse 
Bioscience) previously treated with Cell-Tak (BD Biosciences) immediately before 
the experiment. To measure OCR and the response to OXPHOS inhibition, we 
plated cells in complete stem cell medium (see earlier) prepared with MEBM lacking 
NaHCO; (Lonza) and supplemented with 5 mM pyruvate (Sigma). Oligomycin, 
FCCP, antimycin and rotenone (XF Cell Mito Stress Test Kit, Seahorse Bioscience) 
were sequentially injected at a final concentration of 1 1M. To measure ECAR, we 
plated tumour cells in base assay medium (D5030, Sigma) adjusted to pH 7.4. Glu- 
cose (10 mM final concentration, Sigma), oligomycin (1 [1M final concentration) 
and 2-deoxy-glucose (100 mM final concentration) were sequentially injected (XF 
Glycolysis Stress Test Kit, Seahorse Bioscience). Specifically for the experiment in 
Extended Data Fig. 5c, ECAR was measured in complete stem cell media to eval- 
uate the glycolytic reserve of tumour cells in a nutrient-rich environment. Oligo- 
mycin and 2DG were sequentially injected at a final concentration of 1 1M and 
100 mM, respectively. Experiments were run using a XF96 analyser and raw data 
were normalized to metabolically active cells, evaluated as Hoechst 33342 positive/ 
propidium iodide negative, by an Operetta High-Content Imaging System (Perkin- 
Elmer) immediately after each experiment. 

Glucose uptake and lactate production. To measure glucose uptake, we used two 
different approaches: by flow cytometry using the fluorescent glucose analogue 
2NBDG (see earlier) and by YSI 2950 Biochemistry Analyzer (YSI Life Sciences). 
For YSI, the same number of surviving cells and KRas-expressing cells was plated 
in triplicate in a 96-well plate in 200 ll of complete stem cell medium. After 24h, 
the medium was collected and glucose and lactate concentrations were evaluated 
by YSI analyser, using unconditioned fresh medium asa reference. Lactate produc- 
tion in response to oligomycin treatment (200 nM) was independently measured 
by Lactate Kit (Trinity Biotech) to directly evaluate the concentration of lactic acid 
in stem cell medium 6 to 12h after tumour cell plating. 

ATP and glutathione assays. ATP production of tumour cells in response to 6 to 
12h of oligomycin treatment (200 nM) was measured using Cell Titer Glo (Promega) 


©2014 Macmillan Publishers Limited. All rights reserved 


following the manufacturer’s instructions. For glutathione, cells were lysed by sonica- 
tion in 1X GSH MES buffer (Cayman Chemical) and debris was pelleted by high-speed 
centrifugation. Samples were deproteinized by adding vol:vol of meta-phosphoric 
acid (1 mg ml‘) followed by centrifugation and pH equilibration by the addition 
of triethanolamine. Glutathione content was evaluated through an enzymatic recy- 
cling method by using a commercially available kit (Cayman Chemical). PHERA- 
Star plus microplate reader (BMG Labtech) was used to measure luminescence and 
absorbance. 

Statistical analysis. In vitro and in vivo data are presented as the mean + s.d. Statis- 
tical analyses were performed using a two-tailed Student’s t-test after the evaluation 
of variance. Results from survival experiments were analysed with a Gehan-Breslow- 
Wilcoxon test and expressed as Kaplan-Meier survival curves. 
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Extended Data Figure 1 | Oncogene ablation leads to tumour regression 
in vitro and in vivo. a, In vivo experimental scheme. Tumour cells isolated 
from primary tumours or tumour spheres were injected in nude mice fed with 
doxycycline in drinking water (+ Dox). When mice developed tumours, 
doxycycline was withdrawn (—Dox) and tumours underwent a complete 
macroscopic regression after 2-3 weeks (arrows indicate regressed tumours). In 
residual lesions few tumour cells remain quiescent for months and they can 
quickly reform tumours upon KRas reactivation (+ Dox). b, Tumours 
expressing KRas (+KRas) and tumour remnants after regression (— KRas) are 
positive for ductal epithelial marker CK19 (X40). c, Tumours expressing KRas 
(+KRas) and epithelial remnants after tumour regression (—KRas) were 
stained for phosphorylated-p42/44 (pErk). No signal is detected in surviving 
cells (X20). d, In vitro experimental scheme. After digestion to a single-cell 
suspension, tumour cells isolated from primary tumours were plated in stem 
cell medium in presence of doxycycline (+ Dox, +KRas). Spherogenic cells 
form tumour spheres (+ KRas) that can be maintained by serial replating in 
presence of doxycycline. Upon doxycycline withdrawal (—Dox) tumour 
spheres undergo involution and only a minority of cells survive the ablation of 
KRas (SCs, —KRas). SCs readily reform tumour spheres upon re-activation of 
KRas (+ Dox). e, The amount of active Ras in KRas-expressing cells (+ KRas) 
and SCs (—KRas) was evaluated in three independent tumour spheres by 
detecting the fraction of Ras protein that co-precipitates with Raf kinase. Total 
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lysates were probed with anti-phospho-p42/44 (pErk), total p42/44 (Erk) 
antibodies. f, Annexin V staining in tumour spheres after 3 days +/—KRas 
(n = 3). g, Sphere formation is a regulated process and tumour cells enter 
and exit cell cycle. BrdU incorporation (pulse of 3 h) was evaluated at different 
time points during sphere formation and regression. KRas-expressing fully 
formed spheres (day (D)0 and 8) are quiescent. Upon sphere dissociation 
and replating (D0), spherogenic cells enter cell cycle (D1) and tumour cells 
continue to grow until day 3-4, when spheres reach their maximal S phase. 
Then tumour cells gradually exit the cell cycle and become quiescent (D8). 
After doxycycline withdrawal (—KRas), tumour spheres undergo involution 
and surviving cells remain quiescent until KRas is re-expressed (—KRas 24h 
+KRas) and spheres are reformed. Ruling out the effect of the cell cycle, 
transcriptomic and metabolomic characterizations were done, matching 
quiescent surviving tumour cells to quiescent fully formed KRas-expressing 
spheres at D8 (n = 3). h, Haematoxylin and eosin staining and 
immunohistochemistry of regressed tumours after three 8h pulses of BrdU 
show that epithelial remnants in regressed tumours after KRas ablation 
(—KRas) are completely quiescent (left panels). Forty-eight hours after KRas 
reactivation (doxycycline i.p. injection) tumour cells re-enter massively the cell 
cycle (right panels). Red arrows indicate mitotic cells (X20). i, Representative 
annexin V staining with respect to CD133 and CD44 after 3 days of KRas 
ablation, two independent tumours are represented. Data are mean + s.d. 
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Extended Data Figure 2 | Transplantation in limiting dilution and 
characterization of epithelial remnants. a-d, Transplantation in limiting 
dilution. Experiments, number of transplanted mice and percentage of 
developed tumour are shown. a, Limiting dilution experiments using tumour 
spheres (+KRas) and surviving cells (—KRas) (genetic model ex vitro). 

b, Limiting dilution experiments using cells isolated from KRas-expressing 
(+KRas) and regressed tumours (— KRas) (genetic model ex vivo). c, Top 
panels, immunoblots of tumour spheres treated with different concentrations 
of Mek1 (AZD8330) and dual PI3K/mTOR (BEZ235) inhibitors probed with 
anti-phospho-p42/44 (pErk), phospho-Akt (pAkt), pan-Ras (Ras) and B-actin 
(Act) antibodies. Bottom panels, effects of AZD8330 (AZD 0.01 uM) and 
BEZ235 (BEZ 0.1 1M) treatment for 1 week on tumour sphere formation; some 
cells, as single cells or in small clusters, are able to survive the treatment (5). 
d, Limiting dilution experiments using cells surviving pharmacological 
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downregulation of oncogenic pathways (AZD+BEZ, combination of 
AZD8330 and BEZ235) and control cells (CTRL). e, The plot shows the 
cumulative distribution of coverage at all the SNVs called by Unified Genotyper 
(across samples). f, CD44 is expressed during tumorigenesis in mice: no 
positive cells are detected in normal pancreas (left panel), KRas-expressing 
tumours express high levels of CD44 (middle panel), epithelial remnants in 
regressed tumours maintain their positivity for CD44 (right panel) (10). 

g, Validation of CD133 (ab16518) in immunohistochemistry: this antibody 
does not recognize cells and ductal structures in normal pancreas (left panel), a 
small population of cells is stained by ab16518 in KRas-expressing tumours 
(middle panel), epithelial remnants in regressed tumours are strongly positive 
for CD133 (right panel) (x10). At higher magnification (red boxes), it is 
possible to appreciate the classical polarized pattern of CD133. 
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c, d, Genes encoding proteins of the autophagic molecular machinery and its 
key regulators ( = 5) (c) and B-oxidation (n = 5) (d). e, mRNA fold change 
in cells surviving AZD8330 plus BEZ235 treatment (AZD+BEZ) versus 


controls (n = 3). Data are mean + s.d. 


Extended Data Figure 3 | qPCR validation of pathways enriched in 


5). 


Ils. a-e, mRNA fold change in surviving cells normalized to 


KRas-expressing cells. a, Genes involved in ETC (n = 5). b, Genes involved in 


surviving ce 


the biogenesis and function of mitochondria (Mitochondria) (n 
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Extended Data Figure 4 | Surviving cells have more active mitochondria. 
a, mRNA fold change of Ppargc1 genes in —KRas versus +KRas cells (n = 5). 
b, Quantification of MitoTracker Green staining in +KRas and —KRas cells 
(n = 3). ¢, Mitochondrial membrane potential (AW) of +KRas and —KRas 
cells (n = 4); representative flow-cytometry analysis of two tumours. 

d, Immunoblot of two independent tumour spheres derived from different 
genetic backgrounds (Ink4a/Arf ‘~ and p53‘ ) treated or not with AZD8330 
and BEZ235 (AZD+BEZ) for 7 days and probed with anti-VDAC1 (VDAC1) 
and f-actin (Actin) antibodies. e, Cells surviving AZD+BEZ treatment have 
higher mitochondrial transmembrane potential (Ay,,,) than untreated cells 
(CTRL) (n = 3); representative flow-cytometry analysis is reported. f, Mice 
bearing tumours have been treated (AZD+BEZ) or not (CTRL) with a 
combination of AZD6244 and BEZ235 for 1 week. Upon tail vein injection of a 
bolus of TMRE, tumours were explanted and analysed by flow cytometry for 


their mitochondrial potential (Aw,,) upon gating on CD44* DAPI” cells 

(n = 3). A representatie flow-cytometry analysis of two different tumours is 
reported, AB+CCCP samples represent reacquisition of AZD+BEZ samples 
after incubation with CCCP for 5 min. g, ROS production in +KRas and 
—KRas cells (n = 3); representative flow-cytometry analysis of two tumours. 
h, Live confocal imaging of SCs stained for mitochondria (MitoTracker Green), 
ROS (CellRox-Red) and DNA (Hoechst). The vast majority of signal generated 
by ROS colocalizes with mitochondria. i, Immunoblot of Aldefluor/CD133 
double-positive and double-negative cells sorted from two independent 
tumours probed with anti- VDAC1 (VDAC1) and B-actin (Actin) antibodies. 
j, KRas-expressing cells positive for aldefluor (Ald*) and CD133 (CD133°) 
have higher mitochondrial transmembrane potential (Ay,,,) than tumour cells 
that do not express the same markers (Ald and CD133_) (n = 3); a 
representative flow-cytometry analysis is reported. Data are mean + s.d. 
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Extended Data Figure 5 | OCR, ECAR and metabolomics. a, OCR of KRas- 
expressing cells (+KRas) and SCs (— KRas) in response to oligomycin, FCCP 
and rotenone/antimycin (n = 4). b, Same as in a but normalized to basal 
respiration of +KRas and — Kras cells. c, ECAR response of +KRas and —KRas 
cells to oligomycin and 2DG. The experiment has been carried out in complete 
stem cell media to evaluate the glycolytic reserve of tumour cells in a nutrient 
rich environment. d, Metabolome analysis for +/—KRas cells; unsupervised 
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hierarchical clustering and heat map of significantly (P< 0.05) deregulated 
metabolites (n = 4). e, Lactate production of +KRas and —KRas cells in 
response to oligomycin (Oligo) or DMSO (Ctrl) treatment (n = 3). f, Fold 
change of TCA cycle intermediates in +KRas versus —KRas cells («KG, 
a-ketoglutarate) (n = 4). g, Fold change of nucleotide triphosphates and 
deoxynucleotide triphosphates, glutathione (GSH) and glutathione disulphide 
(GSSG) in —KRas versus + KRas cells (n = 4). Data are mean + s.d. 
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Extended Data Figure 8 | Differential sensitivity of tumour cells to 
OXPHOS inhibition. a, Annexin V staining of cells treated with oligomycin 
200 nM (Oli) for 24h shows a significant decrease in viability in surviving cells 
(—KRas). By contrast, control cells expressing KRas (+KRas) are minimally 
affected (n = 3); a representative flow-cytometry analysis is reported. b, Effect 
of oligomycin (Oli), dicyclohexylcarbodiimide (DCCD), veturicidin (Vent), 
rotenone (Rot), antimycin (Anti) and DMSO (Ctrl) on spherogenic potential of 
KRas-expressing (+KRas) and SCs (—KRas) (n = 4). ¢, In vivo treatment 
experimental scheme: mice were transplanted with tumour cells and fed with 
doxycycline in drinking water (+KRas, + Dox) until they developed tumours of 
1 cm in diameter. Then doxycycline was withdrawn (—KRas, — Dox) and after 
2 weeks, when tumours were regressed, mice were treated with oligomycin 
(0.5 mg kg, ip.) or vehicle for 5 days a week, for 2 weeks. After treatment, 
KRas was re-induced (+Dox) and mice were monitored for tumour relapse. 
d, One dose of oligomycin (0.5 mgkg ', i.p.) is sufficient to increase lactate 
concentration in plasma of treated mice after 4h from injection. Oligo, 
oligomycin; Ctrl, vehicle. n = 4. e, Tumour volume of KRas-expressing 
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tumours treated with either vehicle or oligomycin 0.5mgkg ', 5 days a week, 
for 2 weeks. Treatment was started when tumours reached 5 mm of diameter (5 
mice per group). f, SCs after treatment with oligomycin show signs of 
degeneration and epithelial remnants change their morphology. Red arrows 
indicate the presence of capillaries (red blood cells) indicating that regressed 
tumours are vascularized (40). g, Oligomycin (Oli) induces ROS production 
in KRas-expressing cells (+ KRas) and SCs (—KRas). Its effect is even stronger 
than that of positive control 4-hydroxynonenal (hne) ( = 3). h, Glutathione 
levels in KRas-expressing cells (+ KRas) and SCs (—KRas) before and after 
buthionine sulphoximine (BSO) treatment. Glutathione is increased in SCs and 
BSO treatment is effective in reducing its level (n = 3). i, Effect of glutathione 
depletion on spherogenic potential of KRas-expressing cells (+ KRas) and SCs 
(—KRas) (n = 3). j, ROS production in SCs after treatment with 
4-hydroxynonenal (hne) and oligomycin (oli) in the presence or absence of 
antioxidants: o-tocopherol (vitE), N-acetylcysteine (nac) and tetrakis (Tet) 

(n = 2). k, Effect of oligomycin on spherogenic potential of surviving cells pre- 
treated with antioxidants (n = 4). Data are mean = s.d. 
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Extended Data Figure 9 | Effect of mitochondrial downregulation in 
human tumour spheres and metabolic stress mediated by inhibition of 
autophagy. a, Effects of the combination of AZD8330 and BEZ235 
(AZD+BEZ) on human tumour spheres. Some cells, usually doublets, are able 
to survive the treatment (<5). b, Immunoblots of human tumour spheres 
treated or not with AZD+ BEZ probed with anti-phospho-p42/44 (pErk), total- 
Erk (Erk), phospho-Akt (pAkt), Akt and B-actin (Actin) antibodies; two 
independent tumours are reported. c, Annexin V staining of treated 
(AZD+BEZ) and control (Ctrl) cells after 4 days of treatment (n = 3). 

d, Mitochondrial transmembrane potential (Aiy,,) of untreated (Ctrl) and 
treated (AZD+BEZ) human spheres with AZD8330 and BEZ235 for 7 days 
(n = 3); representative flow-cytometry analysis of two tumours. e-~h, TFAM 
and TUFM were downregulated using two inducible short hairpin RNAs 
(shRNAs) each (TFAM: #93, #95; TUFM: #63, #64) in human spheres 
expressing KRas (untreated) and cells surviving 1 week of treatment with 
AZD8330 and BEZ235 (AZD+BEZ); after 5 days of shRNA induction cells 
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were replated for evaluating their spherogenic capacity. e, Immunoblots of 
tumour spheres after 72h of shRNA induction (+ Dox) probed with anti- 
TFAM, TUEFM and HSP90 antibodies. f, Representative calcein staining after 
sphere replating. g, h, Effects of downregulation of TEAM and TUFM on 
spherogenic potential of untreated (g) and treated (h) cells; data represent the 
average of two independent human tumours. i, Immunoblot of KRas- 
expressing cells treated or not with oligomycin 200 nM (Oligo, +/—) probed 
with anti-Thr-172-phospho-AMPK and actin antibodies. j, k, immunoblots of 
+KRas and —KRas cells treated with etomoxir (Eto, 100 1M for 6h) (j) and 
bafilomycin (Baf, 50 nM for 24h) (k) probed with anti-Thr-172-phospho- 
AMPK and vinculin antibodies. 1, Annexin V staining of cells treated for 48h 
with bafilomycin 50 nM (Baf) and etomoxir 100 1M (Eto) clearly shows a 
significant decrease in viability of SCs (—KRas). Control cells expressing KRas 
(+KRas) are not affected (n = 3); representative dot-plots are reported. Data 
are mean + s.d. 
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Gene Symbol Accession Forward Reverse Product Length 
lipoprotein lipase Lpl NML008509,2, CCAGGGGGTCACCTGGTCGAA = CCAGCTGGTCCACGTCTCCGA 123 
FAT/CD36 CD36 NML00159555.1 9 ACCGGGCCACGTAGAAAACACT ACACAGCCAGGACTGCACCA 150 
solute carrier family 27 Slc27al NYL011977.3 GGCCTCACTGGCGCTGCTTT GGTCTCGCCTCGCCGTCTTG 131 
caveolin 1 Cav NM_007e16,4 GCATGATGCCGGCAGCTTTCCTA }=CACGCTGGCCTCATTCAGTTGC 264 
carnitine palmitoyltransferase La Cptla NM013495.2 CAGCACCTGTACCGCCTCGC AGCCGTCATCAGCAACCGGC 238 
carnitine palmitoyltransferase 1b Cptlb NM009948,2, GGGCAGACGCGCCCATCATT GCTCGGGAATGTCCCACGGC 150 
carnitine palmitoyltransferase le Cptle NM_001252470.1 =ATGGGCCGACTGCCCTGTCT CCCCTCAGGGCCAGCGAGAT 186 
carnitine palmitoyltransferase 2 Cpt2 NML009949.2, GCCCTGTGCCCGAGTTTCCC AACCAGCGGTTCGTGCCGTC 214 
acyl-Coenzyme A dehydrogenase, very long chain Acadvl NAML017366.2, TTTGCCACCGTGGTGGAGGC CTCATGCTGTGCCGTCGGGT 171 
acyl-Coenzyme A dehydrogenase, long-chain Acad! NML007381,3, CCCAGATGACGGCGGGCAAG GCCATGGGCAGGCGATCGAG 212 
acyl-Coenzyme A dehydrogenase, medium chain Acadm NML007382.4 GGGAGGTTGACTCCGGTCGC ATCTGCACGGCGTCAGTGGC 100 
acyl-Coenzyme A dehydrogenase, short chain Acads NM_007383.2 TCCTGGGCGGCATGGGGTAT AGTCCCCGAGGAGGGGCTCA 160 
acyl-Coenzyme A dehydrogenase, short/branched chain Acadsb NML025826.4 AGCCGCGGGAGTCAGGATGA TGGGAGCTGGGGTGCAGACAT 157 
hydroxyacyl-Co A dehydrogenase/3-ketoacyl-Co A thiolase/enoy|-CoA hydratase alpha Hadha NM_178878.2 AGGCCAAGCCCAGGCGTCTA TGCCACCACTAGGCTGCTCG 164 
hydroxyacyl-Co A dehydrogenase/3-ketoacyl-CoA thiolase/enoyl-CoA hydratase beta Hadhb NM 1455581 CTGGCTGTCGGCTGGTCATGG AGTCGGTCGCCTCCTTCTAGAGC 153 
acetyl-Coenzyme A acyltransferase 2 (mitochondrial 3-oxoacyl-Coenzyme A thiolase) Acaa2 NM_177470,3 CGCTCGACCCCAAACCACCC GCCCCAGCACCGTCAGACAC Wi 
acyl-Coenzyme A oxidase 1, palmitoyl Acox1 NM_015729,.2 CAAGTGAGGCGCACTGCCAC GCGTTGGGGCGAGTCACTGT 232 
acyl-Coenzyme A oxidase 2, branched chain Acox2 NM_001161667.1 GITGCCCAGCCCAGACAGCAG AGCAGTGAGCCTIAGCAGCCTGA 191 
acyl-Coenzyme A oxidase 3, pristanoyl Acox3 NNYL030721.2 GCAACCTGGGCAAGACGGCA TCACCGACCATCACCCCCGT 145 
hydroxysteroid (17-beta) dehydrogenase 4 Hsd17b4 NML008292.4 TAGGCCGCCGCCTCAAGAGT CAGCAGAGCCCTTTGCGGGG 162 
enoyl-Coenzyme A, hydratase/3-hydroxyacyl Coenzyme A dehydrogenase Ehhadh NML023737,3 TGTACCTTCGGGGGTCGGGG CGATGCCTCGGCCCATCGTT 168 
acetyl-Coenzyme A acyltransferase 1A Acaala NM 150864.5 ATGCAGTGGTCGGGGTCCCT CTTCTCCACACAGTAGACGGCCT 156 
sterol carrier protein 2. liver Sep2 NM_0113274 GGCCTCGGAGGAGCTGTGGT GCTTCTTTGCCTCCAGGGCCATC 239 
catalase Cat NML009804.2, TTGCCGGCCACCTGAAGGAC GAGAGCCGGCCTGCGTGTAG 165 
superoxide dismutase 2, mitochondrial Sod2 NML013671,3, TGACGTGTGGGAGCACGCTT CAGCCCGGAGCCTGGCACTC 165 
acetyl-Coenzyme A acetyltransferase 1 Acatl NML144784.3, TGCCTCTCAGCCGGCCACTA ACCTGCGCCCAGTGTTGCTT 160 
Peroxisome Proliferator-Activated Receptor Gamma, Coactivator | Alpha ppargela NM_008904 AGGGCGACCCCTTCAAATGGC ACCGGAGGCTGACAACAGGC 173 
Peroxisome Proliferator-Activated Receptor Gamma, Coactivator | Beta ppargelb  NM_133249 TGCCCCTACCAGAGCCCACC CATGCCGGACGCTTGGGGTT 165 


Extended Data Figure 10 | Cells surviving oncogene ablation are engorged 
with autophagosomes and lysosomes and contain lipid droplets. a, SCs 
(—KRas) have a cytoplasm full of phagosomes and autophagosomes, a feature 
absent in KRas-expressing cells (+ KRas) (TEM; X7,500). b, SCs are 


characterized by the presence of several lipid droplets (arrowheads) in the 
cytoplasm (TEM; 7,500). c, Primers used for amplification of mitochondrial 
and lipid metabolic genes. 
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Conditional tolerance of temperate phages via 
transcription-dependent CRISPR-Cas targeting 


Gregory W. Goldberg’, Wenyan Jiang’, David Bikard'+ & Luciano A. Marraffini! 


A fundamental feature of immune systems is the ability to distinguish 
pathogenic from self and commensal elements, and to attack the for- 
mer but tolerate the latter’. Prokaryotic CRISPR-Cas immune systems 
defend against phage infection by using Cas nucleases and small RNA 
guides that specify one or more target sites for cleavage of the viral 
genome”’. Temperate phages include viruses that can integrate into 
the bacterial chromosome, and they can carry genes that provide a 
fitness advantage to the lysogenic host**. However, CRISPR-Cas tar- 
geting that relies strictly on DNA sequence recognition provides indis- 
criminate immunity both to lytic and lysogenic infection by temperate 
phages°—compromising the genetic stability of these potentially ben- 
eficial elements altogether. Here we show that the Staphylococcus 
epidermidis CRISPR-Cas system can prevent lytic infection but tol- 
erate lysogenization by temperate phages. Conditional tolerance is 
achieved through transcription-dependent DNA targeting, and ensures 
that targeting is resumed upon induction of the prophage lytic cycle. 
Our results provide evidence for the functional divergence of CRISPR- 
Cas systems and highlight the importance of targeting mechanism 
diversity. In addition, they extend the concept of ‘tolerance to non- 
self to the prokaryotic branch of adaptive immunity. 

Clustered, regularly interspaced, short palindromic repeat (CRISPR) 
loci and their CRISPR-associated (cas) genes function together as a pro- 
karyotic adaptive immune system which can protect bacteria and archaea 
from invading genetic elements such as viruses (phages)**. Repeat ele- 
ments of the CRISPR locus are intercalated with short ‘spacer’ sequences 
which typically match phage or plasmid genomes and dictate the targets 
for immunity on the basis of sequence identity. Active immunity requires 
transcription of the CRISPR locus, followed by cleavage of the transcript 
within repeat sequences by Cas endoribonucleases”*. This liberates small 
CRISPR RNAs (crRNAs) which specify the target for RNA-guided Cas 
nucleases that defend the cell from infection by degrading invading 
genomes’. Furthermore, CRISPR-Cas systems can behave in an adap- 
tive fashion through what appears to be direct acquisition of new spacer 
sequences from invading elements, thereby conferring sequence-specific, 
heritable immunity. On the basis of cas gene content and organization, 
CRISPR-Cas systems have been classified into three main types (I-III) 
and at least 12 subtypes (A-F)’. A growing body of work indicates a com- 
mon role for these systems in antiviral defence, but the diversity of ccRNA 
processing and targeting mechanisms”? that have been described sug- 
gest the potential for differences in their precise functions in vivo. 

It is well established that CRISPR-Cas systems can tolerate ‘self spacer 
elements within the CRISPR locus DNA via sequence discrimination 
at the flanking repeats. For type I and type II systems, this requires that 
short sequences which license targeting, known as protosopacer adjacent 
motifs, are absent from the repeat sequences flanking each spacer’®"'. For 
type III systems, targeting is prevented by excessive base pairing between 
the repeat-derived crRNA tag and its corresponding DNA sequence”. 
Tolerance to ‘non-self DNA elements, on the other hand, has yet to be 
described. Previous reports indicate that active CRISPR-Cas systems and 
their targets cannot co-exist in the same cell®’*”*. Thus, CRISPR-Cas tar- 
geting that relies strictly on DNA sequence recognition does not offer 
the flexibility to accommodate genetic elements with ambiguous fitness 


costs, such as temperate phages. Upon infection, temperate phages can 
kill the host cell by initiating a lytic cycle, but they may also spare the cell 
from lysis and establish a lysogenic cycle, typically via repression of lytic 
genes and integration into the host chromosome as a so-called prophage”. 
In addition to preventing lysis, lysogenization can result in a variety 
of phenotypic outcomes which can improve host fitness, for example 
via expression of non-viral ‘moron’ genes carried on temperate phage 
genomes*”. The lysis/lysogeny decision is generally governed by a cen- 
tral promoter region which responds to stochastic and environmental 
factors to control transcription in divergent directions, thereby promot- 
ing one or the other infection cycle!’. Under certain conditions, the pro- 
phage can re-initiate a lytic cycle and excise from the chromosome—a 
process referred to as prophage induction. Commitment to either the lytic 
or lysogenic cycle does not involve changes in the viral genome sequence. 
Hence, it is generally accepted that CRISPR-Cas targeting of temperate 
phages should exclude both infection outcomes; in addition to prevent- 
ing lysis, CRISPR attack ofan integrated prophage target precludes stable 
lysogenization. Although this appears to be the case for type I-E (ref. 6) 
and II-A (ref. 16) CRISPR-Cas systems, the potential for tolerance dur- 
ing type III immunity had not been explored. 

To investigate the behaviour of type III CRISPR immunity during 
temperate phage infection, we introduced pGG3, a plasmid carrying the 
type III-A CRISPR-Cas system of S. epidermidis RP62a (ref. 17), into 
Staphylococcus aureus RN4220 (ref. 18). This strain is sensitive to the 
lambda-like temperate phages of S. aureus Newman, a clinical isolate 
harbouring four heteroimmune prophages (DNM1-4) which carry genes 
that enhance the pathogenicity of their host’’. We also identified a spacer 
in one of the CRISPR loci of S. aureus MSHR1132 (ref. 20) with near- 
perfect identity to a conserved target sequence present in DNM1 (Fig. 1a), 
@NM2 and ONM4. This spacer, referred to as 32T (Supplementary 
Table 1), was added to the CRISPR locus of pGG3. Using DNM1, we 
first established that this spacer prevents lytic infection by showing that 
plaquing efficiency is reduced approximately seven orders of magnitude 
compared with a strain carrying the pGG3 plasmid without the DNM1- 
targeting spacer (Fig. 1b). We then introduced an erythromycin resis- 
tance gene (ermC) into DNM1 to facilitate quantification of lysogens 
that have stably integrated a chromosomal prophage (creating DNM1- 
Erm®*). Using this system, we expected to find results consistent with a 
report describing CRISPR-mediated immunity to lysogenization by phage 
lambda in E. coli®. Surprisingly, we obtained the same efficiency oflysog- 
enization compared with the control strain lacking spacer 32T (Fig. 1c). 
To test whether the presence of mismatches between the 32T crRNA and 
its target was influencing this phenomenon, we engineered spacer 32T* 
with a perfect match to its target, but obtained the same results (Fig. 1b, c). 
We next sought to determine whether genetic CRISPR-Cas inactivation 
is responsible for the apparent tolerance of these lysogens by testing them 
for sensitivity to DNM2. All 14 clones maintained resistance to DNM2 
mediated by spacer 32T (Extended Data Fig. 1a, d). Finally, we demon- 
strated that spacer 32T tolerance does not result from genetic alteration 
of the target phage (Extended Data Fig. 1b, c). Tolerance was also observed 
for DNM4 (Extended Data Fig. 2), demonstrating that the tolerance phe- 
nomenon is not specific for the ONM1-Erm® phage or its integration 
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Figure 1 | Type III-A CRISPR immunity can block lytic infection but 
tolerate lysogenization. a, Base pairing interaction between crRNA 32T 
and its target in the DNM1 genome (highlighted in grey). The crRNA tag is 
a sequence transcribed from the CRISPR repeat that needs to be unpaired 
with the flanking region of the target to license immunity. The target gene is 
transcribed from left to right. b, CRISPR immunity against DNM1 infection 
provided by spacers 32T and 32T* (similar to 32T but without mismatches), 
measured as a decrease in the number of plaque-forming units (p.f-u.) with 
respect to the non-targeting control pGG3 (C). c, Lysogenization with 
@®NM1-Erm‘* in the presence of spacers 32T and 32T* or the pGG3 control (C), 
measured as the number of chloramphenicol- and erythromycin-resistant 
colony-forming units (c.f.u.) per millilitre obtained after infection. Control 
cells lysogenized with ®NM1 (C’) lack the ermC insertion and do not 

yield erythromycin-resistant colony-forming units. Error bars, mean + s.d. 
(n = 3). 


locus. These results demonstrate that type III-A CRISPR immunity can 
block lytic infection but tolerate lysogenization without concomitant 
genetic CRISPR-Cas inactivation or alteration of the phage genome. 
To determine whether prophage tolerance is a spacer-specific phenom- 
enon, we designed a variety of spacers with 100% target identity, target- 
ing different regions of the PNM1 genome on both strands (Fig. 2a). We 
first tested the ability of each spacer to prevent lytic infection (Fig. 2b). 
Surprisingly, spacer functionality varied with the predicted transcriptional 
context of each target sequence. Spacers matching putative lytic genes 
to the right of the central promoter which are predicted to be unidirec- 
tionally transcribed were only effective when they targeted the predicted 
non-template strand (top strand according to our spacer nomenclature). 
Meanwhile, transcription is predicted to be bi-directional to the left of 
the central promoter’’. Spacers targeting this region prevented plaque 
formation regardless of the strand targeted. This resembled the activity 
reported for the type III-B CRISPR-Cas system of the archaeon Sulfolobus 
islandicus REY15A, where immunity to plasmid transformation depen- 
ded on the presence of promoters flanking a target sequence”. We thus 
reasoned that transcription-dependent targeting could explain the dis- 
crepancies in spacer functionality. Indeed, DNM1 transcription profiles 
assessed by RNA sequencing of RN4220 cultures 6, 15, 30 and 45 min 
after infection revealed predominantly unidirectional transcription to 
the right of the central promoter, while bi-directional transcription was 
detected to the left of the central promoter (Extended Data Fig. 3). 
Further evidence for the transcription-dependence of type III-A CRISPR- 
Cas targeting was obtained via the characterization of a spacer 2B CRISPR- 
escape mutant phage, ONM1y6, exhibiting a clear plaque phenotype 
characteristic of phages that cannot establish lysogeny (Fig. 2c, inset). 
Sanger sequencing of the spacer 2B target sequence did not reveal any 
mutations in the target or flanking sequences (data not shown); thus, we 
measured the DNM1Y6 plaquing efficiency with other spacers to deter- 
mine whether it possessed a sequence-independent, general CRISPR- 
escape phenotype (Fig. 2c). Although most spacers provided immunity 
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Figure 2 | Transcription of target sequences is required for type III-A 
CRISPR immunity. a, The DNM1 genome and the position of targets used in 
this study. T, crRNA anneals to the top strand; B, bottom strand. Grey arrows 
represent the ®NM1 central promoter driving divergent transcription. 

b, Immunity against DNM1 infection provided by spacers targeting the phage 
regions shown in a. Dotted line indicates the limit of detection for the assay. 
c, Immunity against DNM1Y6 infection. Inset, comparison of lysis phenotypes 
for DNM1 (turbid) and DNM1y6 (clear), representative of four technical 
replicates. d, Leftward promoter consensus sequences at the DNM1 and 
@®NM1Y6 central promoter. The DNM1y6 mutation in the —10 element is 
shown in red. The putative transcription start site is noted (+1). e, Comparison 
of phage transcription profiles from cells infected with ®NM1 (grey line) or 
@®NM176 (red line), 15 min post-infection. Phage-derived transcripts are 
plotted in reads per million total-mapped reads relative to their position on the 
genome; arrows indicate the direction of transcription plotted in each graph; 
the vertical dotted line marks the position of the central promoter. Error bars, 
mean + s.d. (n = 3). 


against DNM1Y6, we identified one additional spacer, 4B, which was 
escaped by the mutant phage. Both the 2B and 4B spacers target the same 
strand in the lysogenization operon to the left of the central promoter. 
Importantly, the two complementary spacers (2T and 4T) targeting the 
opposite strand of spacers 2B and 4B were not escaped by ONM1Y6, indi- 
cating that the 2B/4B escape phenotype did not result from changes to 
the target DNA per se. Consistent with this, we did not observe differ- 
ences in the DNM1 and ®NM176 plaquing efficiency when targeting 
the 4B region via Cas9-mediated type II-A CRISPRimmunity (Extended 
Data Fig. 4a), which was shown to cleave double-stranded DNA even in 
the absence of target transcription’. We thus reasoned that the DNM1y6 
type III-A CRISPR-escape and clear-plaque phenotypes could result from 
a localized, unidirectional reduction in transcription, for example, left- 
ward from the central promoters. Indeed, de novo sequencing of DNM1y6 
revealed a single nucleotide polymorphism in a crucial residue of the 
leftward — 10 element (Fig. 2d) of the central promoters, immediately 
upstream of the SAPPVI_g4 cI-like repressor gene required for lyso- 
genic establishment, and ~ 1700 base pairs (bp) away from the 2B target 
sequence. Encouraged by this result, we directly assessed DNM176 tran- 
scription profiles using RNA sequencing, 6 and 15 min after infection 
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(Extended Data Fig. 5a). Consistent with our hypothesis, leftward tran- 
scription (Fig. 2e, lower panel) of the lysogenization operon 15 min after 
infection was strongly reduced, while rightward transcription (Fig. 2e, 
upper panel) in this region was relatively unchanged. Taken together, 
these findings suggest that transcription across target sequences is a 
requirement for type III-A CRISPR immunity. Previously reported** 
strand-independent immunity against plasmids in S. epidermidis may 
also follow this rule, as bi-directional transcription was detected across 
targets (Extended Data Fig. 6). 

Given that temperate phages silence transcription of their lytic genes 
during lysogeny"’, we hypothesized that transcription-dependent target- 
ing would allow a variety of otherwise effective spacers to tolerate pro- 
phage target sequences. The corollary to this prediction is that targets 
which are constitutively transcribed during lysogeny (for example, left- 
ward from the central promoter; Extended Data Fig. 5b) should not be 
tolerated. Five additional spacers were tested for their ability to tolerate 
lysogenization by ONM1-Erm‘* (Fig. 3a). As expected, lysogenization 
was tolerated by spacers 17T, 43T and 61T-1 targeting lytic genes to the 
right of the central promoter, but not by spacers 2B and 17B targeting 
genes constitutively expressed leftwards during lysogeny. Consistent with 
the notion that type III-A tolerance results from differences in transcrip- 
tion at the target, we did not observe tolerance with the transcription- 
independent type II-A CRISPR-Cas system, even when targeting a lytic 
gene (Extended Data Fig. 4b-h). To rule out the possibility that type III-A 
tolerance is influenced by processes that occur during phage infection, 
we corroborated these results using a ‘reverse’ CRISPR immunity assay 
by electroporating CRISPR plasmids into pre-existing lysogens. In this 
experiment, transformation with a targeting CRISPR-Cas system results 
in attack of chromosomal prophage sequences and an inability to form 
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Figure 3 | Conditional tolerance is achieved via transcription-dependent 
CRISPR-Cas targeting. a, DNM1-Erm® lysogenization for additional spacers. 
C, pGG3 non-targeting control. b, Transformation of ©NM1-lysogenic 
competent cells with CRISPR-Cas plasmids containing different spacers 
(transformation efficiency is measured as colony-forming units per 
nanogram of plasmid DNA). C, pGG3 non-targeting control. c, Integration 
of the 43T/B ®NM1 target region into the chromosome of S. aureus. Target 
sequences (inserted in both forward and reverse orientations) are under the 
control of the tetracycline-inducible promoter Pye. The 43T/B crRNAs 
are shown annealing to either the top or bottom strands. d, Transformation of 
both strains shown in c, as well as an isogenic control strain lacking the target 
insertion, with CRISPR-Cas plasmids containing spacers 43T or 43B. 
Transformants were plated on selective plates with or without ATc for 
induction of the P,yiree* promoter. e, Growth curve of strains shown in 

d expressing the spacer 43T CRISPR-Cas system, in the presence or absence of 
ATc addition at the indicated time point (black arrow). Error bars, mean = s.d. 
(n = 3). 
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colonies'*. Again, results varied with the transcriptional context of target 
sequences in a manner consistent with our previous findings (Fig. 3b). 
Collectively, these results demonstrate that prophages are not intrinsi- 
cally tolerated during CRISPR immunity in staphylococci, and suggest 
rather that type III-A tolerance is only achieved during lysogeny under 
the condition that transcription is silenced at target sequences. 

To demonstrate definitively that transcription-dependent targeting 
offers a biological mechanism for conditional tolerance, we integrated 
the DNM1 target sequence for spacers 43T and 43B into the chromosome 
of S. aureus RN4220 under the control ofa tightly regulated tetracycline- 
inducible promoter, thus emulating target lysogenization. The target was 
placed in both orientations with respect to the inducible promoter (Fig. 3c) 
and with respect to the chromosomal origin of replication (Extended 
Data Fig. 7a). The resulting strains were then transformed with the spacer 
43T or 43B plasmids in a reverse CRISPR immunity assay, and plated 
in the absence or presence of the inducer. CRISPR immunity was only 
achieved when transcription across the target was induced with anhy- 
drotetracycline (ATc) in the presence of an antisense crRNA, regard- 
less of the target’s orientation (Fig. 3d and Extended Data Fig. 7a). Once 
again, we confirmed this finding to be a type-III-specific phenomenon 
by transforming the strains from Fig. 3c with the spacer 43B-tII type II- 
A CRISPR plasmid targeting the same region (Extended Data Fig. 7b). 
We corroborated this result by following the growth of spacer 43T trans- 
formants in liquid media (Fig. 3e). Upon addition of the inducer, growth 
was only inhibited for cells with the target in the forward orientation 
for which spacer 43T produces an antisense crRNA. Importantly, tol- 
erance achieved in the absence of the inducer did not appear to affect 
growth (Fig. 3e, dotted lines). Finally, having established that type III-A 
CRISPR-Cas systems can block lytic infection but tolerate lysogeniza- 
tion, we examined the effect of tolerant spacers on prophage induction 
of DNM1 lysogens in culture. Compared with a spacerless lysogen con- 
trol, the phage titre resulting from spontaneous induction of overnight 
cultures was significantly lower for lysogens harbouring a tolerant spacer 
(Fig. 4a). We next followed the growth of cultures induced directly with 
the DNA-damaging agent mitomycin C (Fig. 4b, solid lines). While the 
spacerless lysogen control cultures succumbed to prophage induction, 
the presence of a tolerant spacer prevented lysis. 

We report here that type III-A immunity can offer conditional toler- 
ance to ‘non-self genetic elements, in this case, temperate phages. This has 
several important implications for the CRISPR-Cas system and its host 
population. Tolerance helps ensure the genetic stability of the CRISPR- 
Cas system, since selective pressure to integrate prophages in the pres- 
ence of intolerant spacers can drive genetic CRISPR-Cas inactivation 
(Extended Data Fig. 4c-e)—similar to what occurs during plasmid uptake”. 
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Figure 4 | Prophage induction is limited by type III-A CRISPR immunity in 
tolerant lysogens. a, Plaque-forming potential (measured in plaque-forming 
units per millilitre) of supernatants from overnight cultures of ONM1 lysogens 
carrying the tolerant spacer 32T or 43T CRISPR plasmids, or the pGG3 
non-targeting control (C). b, Growth curve of PNM1-Erm* lysogens or a 
non-lysogen control harbouring the pGG3 or spacer 43T CRISPR plasmids as 
indicated, with or without the addition of the prophage-inducing agent 
mitomycin C (MMC) at the indicated time point (black arrow). Error bars, 
mean = s.d. (n = 3). 
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In other words, tolerant spacers ensure that a population can sample 
potentially beneficial phenotypes that result from prophage integration 
without compromising their CRISPR-mediated immunity. Tolerance 
may also be particularly vital for type III systems, which were recently 
shown to provide immunity despite up to 15 mismatches with their spacer”. 
Thus, without the potential for phages to evade targeting readily via 
point mutation that is seen for type I and type II systems'*"’, the tran- 
scriptional dependence of type III-A targeting offers temperate phages 
an alternative route to lysogenization that need not provide selection 
for mutants. Furthermore, tolerant lysogens had the added potential 
of resisting lysis via prophage induction. Although a few spacers target- 
ing the lysogenization functions did not provide tolerance, it is impor- 
tant to note that these genes only constitute a small portion of the phage 
genome. Hence, spacers targeting this region should be acquired less 
frequently, even if spacer acquisition occurs randomly without an addi- 
tional mechanism for distinguishing tolerant from intolerant spacers 
during acquisition. 

The requirement for transcription across target sequences during type 
III-A immunity contrasts with the transcription-independent targeting 
reported for type I (ref. 27) and type II (refs 22, 23) CRISPR-Cas systems. 
Given the temporal pattern of target transcription observed during the 
phage lytic cycle, it might be expected that CRISPR targeting of late genes 
would not provide immunity if the cell’s survival is already compromised 
at the onset of targeting. Indeed, we observed some differences in spacer 
effectiveness when infecting cells in liquid culture at very high multiplicity 
of infection (MOI; ~ 100): spacers targeting late genes were less protec- 
tive (Extended Data Fig. 8). However, this effect was not pronounced at 
an MOTof 10, suggesting that the system is generally robust to delays in 
target transcription, in accordance with what we observed in efficiency 
of plaquing assays. Consistent with our findings at large, our survey of 
sequenced staphylococcal type III spacers showed that naturally acquired 
spacers with known target sequences produced crRNAs complemen- 
tary to the non-template strand of predicted ORFs in nine out of ten 
cases (Supplementary Table 2). This bias suggests negative selection on 
non-functional spacers targeting template strands. Alternatively, type 
III systems may utilize an unknown mechanism to discriminate tem- 
plate and non-template strands during spacer acquisition. The molec- 
ular details of the transcription-dependent targeting mechanism remain 
unclear. Preliminary experiments indicate that the presence of a tran- 
script provided in trans is not sufficient to license DNA targeting (Ex- 
tended Data Fig. 9). Consistent with this, induction of transcription across 
a plasmid-borne target results specifically in loss of the targeted plasmid 
(Extended Data Fig. 10). Hence, transcription in cis is probably required 
for DNA targeting. One possibility is that negative supercoiling gener- 
ated in the wake of a passing transcription bubble could facilitate target 
DNA melting and improve crRNA recognition’’. This hypothesis, how- 
ever, would not account for the template and non-template strand asym- 
metry observed in our system. Another possibility is that exposure of the 
target non-template strand within the transcription bubble is required 
for annealing of a crRNA”". In this scenario, the observed asymmetry 
might be explained by occlusion of base-pairing to the template strand 
by either the nascent transcript or the RNA polymerase™*. However, given 
the small size of the transcription bubble, this explanation may also be 
incomplete. Alternatively, transcription may be required to activate a 
targeting mechanism rather than facilitating target recognition or bind- 
ing per se. In this case, effective targeting could require base-pairing 
potential between the crRNA and the nascent transcript in cis, which 
would be absent for crRNAs with complementarity to the template strand. 
Previous reports”””° have revealed specific RNA target cleavage activity 
for type III(-B/C) systems that could be relevant to the transcription- 
dependent DNA targeting observed in staphylococci. Further experi- 
mentation will be needed to clarify these points. In summary, our work 
expands the repertoire of CRISPR-based immune functions to include 
anovel capacity for conditional tolerance of foreign elements, and estab- 
lishes distinct genetic outcomes resulting from immunity to temperate 
phages via divergent CRISPR-Cas targeting mechanisms. 
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METHODS 

Bacterial strains and growth conditions. Cultivation of S. aureus RN4220 (ref. 18), 
TB4 (ref. 19) and derivative strains was done in TSB media (BD) at 37 °C, except 
when phage infections were performed, or when otherwise noted (see below). When- 
ever applicable, media were supplemented with chloramphenicol at 10 ug ml ' to 
ensure CRISPR plasmid maintenance. RN4220 strains harbouring pCL55-derived 
insertion vectors were grown similarly, but kanamycin was provided at 25 jg ml 
except during re-culture for competent cell preparation. E. coli DH5a was grown in 
LB Broth (BD) supplemented with kanamycin at 25-50 pg ml“! to maintain pCL55- 
derived plasmids. Selection for ONM1-Erm lysogens with resistance to erythro- 
mycin (10 jig ml ') was only applied during the lysogenization protocol as described 
below, and, where applicable, during the subsequent ®NM2 sensitivity assays. 
Estimation of phage lysate titres. Serial dilutions were prepared in triplicate and 
plated on soft agar lawns of RN4220 in HIB-agar (BD) supplemented with 5 mM 
CaCl, (technical replicates). Plates were incubated at 37 °C for 16-24 h after drying 
at room temperature (25 °C). 

DNA preparation and cloning. Plasmid DNA was purified from 2 to 6 ml of E. 
coli DH5a or S. aureus RN4220 overnight cultures. For preparation from S. aureus 
cultures, cells were pelleted, re-suspended in 100 pl TSM buffer (50 mM Tris-HCl 
pH7.5, 10 mM MgCl, 0.5 M sucrose) then treated with 5 pil lysostaphin (2 mg ml") 
at 37 °C for 1.5h before treatment with plasmid miniprep reagents from Qiagen. 
Purification used Qiagen or EconoSpin columns. 

Cloning used RN4220 electrocompetent cells unless otherwise stated. For most 
type III CRISPR plasmids, scarless addition of repeat-spacer units to the pGG3 par- 
ent vector was accomplished by ‘round-the-horn’ PCR (ref. 31) followed by blunt 
ligation, using common primer oGG12 and spacer-specific oligonucleotides listed 
in Supplementary Table 3. The pGG3 vector was itself constructed by ‘round-the- 
horn’ PCR using primers L55 and A10 to remove extraneous repeat-spacer elements 
from the pWJ308 (ref. 32) CRISPR array. For construction of the remaining type III 
CRISPR plasmids, a modified parent vector (pGG3-Bsal) was created by introduc- 
ing a placeholder spacer harbouring two Bsal restriction sites, to facilitate scarless 
cloning of spacers by replacement with annealed oligonucleotide pairs possessing 
Bsal-compatible overhangs. Type III-A CRISPR arrays were amplified with primers 
L50/L6 and sequenced by Sanger using either forward or reverse primers. The Bsal 
cloning method was also used to construct type II CRISPR plasmids from the pDB184 
parent vector, a modified version of pW]40 with only the single placeholder spacer. 
The pC194-derived pWJ40 vector contained the full S. pyogenes MIGAS type II 
CRISPR-Cas system and was constructed by amplifying S. pyogenes genomic DNA 
with oligonucleotides L362/W278 and pC194 with oligonucleotides W270/W282, 
followed by digestion of the PCR products with BglII and BssSI and a subsequent 
ligation. Type II CRISPR arrays were amplified with primers L448 and W176, and 
sequenced by Sanger using L448. After the cloning of each spacer, plasmid sizes were 
verified by restriction digest with BssSI for type III plasmids or Btgl for type II plas- 
mids. pDB184 was created via Gibson assembly of two PCR fragments: a pWJ40 
backbone amplified using primers B220/B334, and a CRISPR array amplified from 
pCas9 (ref. 14) using primers L448/B333. 

For construction of pCL55-derived inducible target vectors, cloning used chem- 
ically competent DH5o cells. Briefly, the chloramphenicol resistance cassette was 
first replaced with a kanamycin resistance cassette amplified from strep LAM202- 
3 using primers L484/L485. This was accomplished by ‘round-the-horn’ PCR on 
the pCL55-iTET parent vector using primers L482/L483, followed by blunt liga- 
tion with the PCR-amplified resistance cassette to create the new pKL55-iTET-B 
parent vector. Directionality of the insertion was verified afterwards by restriction 
digest with Btgl. Modification of the P,yi/tet promoter in accordance with pRAB12 
(ref. 33) architecture was achieved via two consecutive overlap PCR steps to intro- 
duce point mutations using oligonucleotide pairs oGG108/0GG109 and oGG110/ 
0GG111, followed by a ‘round-the-horn’ PCR step and blunt ligation to introduce 
the downstream operator sequence using oligonucleotides oGG112 and oGG113. 
The resulting pXKL55-iTET-RC12 vector habouring the P,.y1/re¢* modifications was 
used for downstream manipulations, as well as integration into the RN4220 chro- 
mosome to create the “Targetless’ control strain. For forward and reverse target inser- 
tions, annealed oligonucleotide pairs (0GG124/0GG125 and 0GG126/0GG127, 
respectively) with appropriate overhangs were ligated into the multiple cloning site 
after digesting the vector with BglII and SaclI restriction enzymes (NEB). Target 
insertions were verified by PCR amplification and Sanger sequencing using the primers 
oGG64 and oGG838. Inversion of the attP motif for both forward and reverse target 
vectors was achieved by Gibson assembly of two PCR fragments, using oligonucle- 
otides oGG102/0GG103 for the attP motif and 0GG104/0GG105 for the backbone. 
Directional integration into the RN4220 chromosome was verified by amplifica- 
tion of either the attL or attR junctions using primer pairs oGG50/oGG96 and 
0GG51/0GG96, respectively. The pWJ153-inducible target vector is a pKL55-iTET- 
RC12- and pE194- (ref. 34) derived plasmid constructed via multiple steps of either 


‘round-the-horn PCR" followed by blunt ligation or Gibson assembly. The full 
sequence is provided as Supplementary Sequence 1. 

Construction of the PNM1-Erm*® lysogen was achieved via pKOR allelic exchange™. 
Homology arms (~1 kb) were amplified from the chromosome of S. aureus RN4220:: 
®NM1 using primer pairs oGG181/0GG182 and oGG185/0GG186, while the ~1.25 kb 
ermC resistance cassette was amplified from a pE194 plasmid preparation using 
primers oGG183 and oGG184. An ~3.25 kb fragment was assembled by SOEing 
PCR* using external primers oGG181 and oGG186 with clonase (QuikChange) attB 
adapters that allowed directional integration into the pKOR vector**. Sequence integ- 
rity of the ~3.25 kb insertion was verified by Sanger using primers L29, oGG191, 
0GG192, W277 and L325. 

Preparation of electrocompetent S. aureus cells. S. aureus RN4220, TB4 or deriv- 
ative strains were grown overnight in TSB medium, diluted 1:100 in fresh medium 
without antibiotics, then allowed to grow to an attenuance (Doo nm) reading of 0.8- 
1.0 for RN4220 or 0.7-0.9 for TB4. Measurements were taken using a NanoDrop 
2000c Spectrophotometer (Thermo Scientific) and disposable polystyrene cuvettes. 
After re-culture, cells were pelleted at 4 °C, and two or three washes were performed 
using chilled, sterile dHO or 10% glycerol. Cells were ultimately re-suspended in 
1/100th volume of chilled, sterile 10% glyercol and 50 il aliquots were distributed 
for storage at —80 °C. 

Efficiency of plaquing assays. High-titre lysates (~10'* p.fu. ml‘) of either 
@®NM1, ODNM1y6 or ®NM2 were serially diluted in triplicate and applied to soft 
agar lawns of RN4220 strains harbouring CRISPR plasmids, including pGG3 or 
pDB184 spacerless control lawns infected in parallel (technical replicates). Plates 
were incubated at 37 °C for 18 h. After incubation, plates were monitored at bench 
top for up to 24h to facilitate quantification of plaque-forming units. 
Quantification of erythromycin-resistant lysogens. Overnight cultures of RN4220 
with respective CRISPR plasmids were inoculated in triplicate from single colonies 
in HIB medium supplemented with chloramphenicol (biological replicates). After 
chilling at 4 °C, 1:10 dilutions were prepared in 1 ml fresh HIB supplemented with 
chloramphenicol and 5mM CaCl. Diluted cultures were infected with DNM1- 
Erm at ~MOI 10 and incubated on ice for 30 min. After incubation on ice, cultures 
were transferred to a 37 °C incubator for 30 min with shaking. Serial dilutions from 
each culture were then applied to HIB-agar plates supplemented with chloramphen- 
icol, erythromycin and 5 mM CaCl, for quantification of lysogenic colony-forming 
units. In selected cases, type III-A CRISPR locus and target sequence integrity was 
verified by colony PCR after re-streaking single colonies using primer pairs L6/L50 
(CRISPR array) and oGG25/0GG26 (ORF 2) or oGG38/0GG39 (ORF 32). Where 
applicable, Sanger sequencing of PCR products was also performed using these 
primers. When verifying type II lysogenization isolates, the spacer 43B-tII target 
region was amplified using primers oGG233 and oGG234, and the type II CRISPR 
array was amplified using L448 and W176. The presence of integrated ®NM1 or 
@NM1-Erm™ prophages was confirmed by colony PCR using primer pairs oGG191/ 
W277 and oGG206/W276 to amplify the attL and attR junctions, respectively. To 
estimate the total number of recipient cells, serial dilutions of untreated overnight 
cultures were plated on TSB- or HIB-agar supplemented with chloramphenicol. 
@NM2-sensitivity assay. High-titre lysate of PNM2 (~10"? p.fu. ml‘) was applied 
to the surface of a pre-dried HIB-agar plate supplemented with 5 mM CaCl, and 
appropriate antibiotics, then allowed to dry for an additional ~30 min at room tem- 
perature. Single colonies isolated from Erm lysogeny experiments or CRISPR plas- 
mid transformations were streaked through the DNM2-seeded region using a sterile 
plastic loop then incubated for ~12h at 37 °C. 

Enumeration of plaque-forming units liberated from lysogen cultures. Over- 
night cultures of either RN4220::0DNM1-Erm or RN4220::DNM1 lysogens harbour- 
ing targeting CRISPR plasmids or non-targeting control plasmids were inoculated in 
triplicate from single colonies in HIB media supplemented with chloramphenicol 
(biological replicates). After overnight growth, cells were transferred to 4°C then 
pelleted by centrifugation at 4696g for 5 min. Supernatants were filtered, and 100 pl 
from each lysate was mixed with 100 ll of either an indicator strain or targeting 
strain overnight culture for plating by the soft agar method. After drying at room 
temperature (25 °C), plates were incubated 18 h at 37 °C. 

Screen for lipase-negative DNM4 lysogens. An overnight culture of S. aureus TB4 
harbouring the spacer 32T CRISPR plasmid was re-cultured to log phase growth in 
HIB medium supplemented with 5 mM CaCl). After measurement of Dgoo nm cells 
were treated with DNM4 at ~MOI50. After incubation for 1 h, cells were plated on 
TSA supplemented with 5% egg yolk emulsion. After ~24h incubation at 37 °C, 
approximately 1,000 colonies were inspected for lipase secretion. Two lipase-negative 
candidates were re-streaked to single colonies, and the presence of an integrated 
@NM4 prophage was confirmed by colony PCR using primers oGG50 and oGG96 
to amplify the attL junction. 

Phage DNA isolation and deep sequencing. Samples of high titre phage lysates 
(~10” p-f.u. ml ') were treated with DNase and RNase toa final volume of 150 pl 
for 1h at 37 °C. Samples were treated with EDTA (pH 8.0) toa final concentration 
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of 20 mM, followed by treatment with SDS to a final concentration of 0.5% and 2 pl 
proteinase K. Samples were incubated for 1 h at 65 °C, then subjected to a PCR puri- 
fication protocol (Qiagen). Paired-end library preparation was performed on puri- 
fied phage DNA using a Nextera Tagmentation protocol (Illumina), and samples 
were pooled for multiplexed sequencing on a MiSeq (Illumina). De novo assembly 
of phage genomes used ABySS”’. 

RNA preparation for reverse transcription PCR and RNA sequencing. For 
reverse transcription PCR (RT-PCR), overnight cultures were diluted 1:20 in 25 ml 
fresh media and grown for 2.5 h at 37 °C with shaking. After re-culture, cells were 
pelleted and washed twice in 1 ml ice cold TSM buffer, then treated with 3 jl lyso- 
staphin (2 mg ml‘) for 20 min at 37 °C in 500 jl TSM buffer. Treated cells were 
pelleted then re-suspended in 750 1l cold TRIzol Reagent (Life Technologies) after 
discarding the supernatant. The following chloroform extraction and precipitation 
was performed according to the manufacturer’s protocol. After resuspension in dH20, 
samples were treated with Qiagen DNase I for 45 min at 30 °C, then re-purified using 
RNeasy Cleanup columns (Qiagen). In some cases, it was necessary to repeat this 
step a second time to ensure the complete removal of DNA. After cleanup, all sam- 
ples were again treated with DNase I (Sigma-Aldrich) for 30-45 min at 30 °C, before 
use in the reverse transcription reaction. 

For RNA sequencing, overnight cultures were diluted 1:100 in fresh HIB supple- 
mented with chloramphenicol and 500 LM CaCl, and grown for 1.5 h (approximately 
mid-log phase) at 37 °C with shaking. Cultures were removed, infected at MOI ~ 20 
then split into 10 ml portions for an additional 6, 15, 30 or 45 min of growth. Imme- 
diately following incubation, samples were mixed with 10 ml ofa 1:1 acetone/ethanol 
solution and transferred to —80 °C. The ®NM1 lysogen was grown similarly, except 
without antibiotics, and harvested immediately after the 1.5 h re-culture at 37 °C. 
After at least one overnight at —80 °C, samples were thawed on ice and pelleted by 
centrifugation at 4696g for 10 min. After two washes of 1 ml TE buffer, cells were 
re-suspended in 1 ml RLT buffer (Qiagen) supplemented with BME, and transferred 
to 2-ml tubes pre-loaded with ~0.5-1 cm’ of 0.1 mm glass beads (BioSpec). Samples 
were processed in a Mini-Beadbeater instrument (BioSpec) three times for 10s at 
4,200 oscillations per minute, with 40 s of chilling on ice between runs. After bead- 
beating, samples were spun down for 2 min at 16,100g in a refrigerated microcen- 
trifuge. Supernatant (750 ul) was transferred to a clean tube for mixing with 500 pl 
of 100% ethanol, and the following RNeasy purification was done according to the 
manufacturer’s protocol (Qiagen). After elution, samples were treated with either 
Qiagen or Sigma-Aldrich DNase I for 30-45 min at 30 °C, then re-purified using 
RNeasy cleanup columns. In some cases, it was necessary to repeat this step a second 
time to ensure the complete removal of DNA. rRNA-depleted samples were sub- 
sequently generated using the RiboZero Magnetic Kit for bacteria (Epicentre), accord- 
ing to the manufacturer’s protocol. 

RT-PCR. Reverse transcription used M-Mulv Reverse Transcriptase (NEB), with 
DNA-free total RNA isolated from RN4220 cultures harbouring either the pNes(wt-d) 
or pNes(wt-i) plasmids as templates for cDNA synthesis. For pNes(wt-d), reverse 
transcription used either the L8 or L86 primers in two separate 30 ll reactions, 
alongside mock reactions (-RT enzyme). For pNes(wt-i), the same was done using 
primers L8 or L87. After incubation, 1 1] of each reaction was used as a template for 
PCR, with respective primer pairs for each sample. 

Phage transcriptome analysis and visualization. Reads were aligned to reference 
genomes using Bowtie and sorted using Samtools. Using a custom script, sorted reads 
were accessed via Pysam, normalized as reads per million values, and plotted in log 
scale as the average over consecutive windows of 500 base pairs using matplotlib 
tools for [Python. 

Transformation assays. S. aureus RN4220 plasmid preparations were dialysed on 
0.025 jum nitrocellulose filters (Millipore) then quantified using a NanoDrop 2000c 
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Spectrophotometer (Thermo Scientific). Aliquots (50 il) of electrocompetent cells 
were transformed in triplicate with 80 ng dialysed DNA per transformation using a 
GenePulser Xcell (BioRad) with the following parameters: 2900 V, 25 LF, 100Q, 
2mm (technical replicates). After electroporation, cells were immediately re-suspended 
in TSB toa final volume of 200 pil and recovered at 30 °C for 2 h with shaking. Serial 
dilutions were then prepared before plating with appropriate antibiotics. For reverse 
CRISPR immunity assays targeting insertion vectors, additional plating in the pres- 
ence of ATc ata final concentration of 0.5 tg ml‘ was performed in parallel using 
the same dilutions. Plates were incubated at 37 °C for 18-24h. 

Plate reader growth curves. For ATc induction experiments, overnight cultures 
were launched from single colonies in triplicate and diluted 1:200 in TSB broth (bio- 
logical replicates). After 1h of growth, ATc was added at a final concentration of 
0.5 tg ml’ where applicable. Measurements were taken every 5 min. For mitomy- 
cin C induction experiments, overnight cultures were launched from single colonies 
in duplicate and diluted 1:100 in HIB broth (biological replicates). After 1.5 h of growth, 
mitomycin C was added at a final concentration of 0.5 tg ml ' where applicable. 
Measurements were taken every 10 min. For ®NM1 infections, overnight cultures 
were launched from single colonies in triplicate and diluted 1:100 in HIB broth 
supplemented with CaCl, 5 mM (biological replicates). After 1 h 25 min of growth, 
Deo nm Was measured for three representative cultures to estimate MOI. Aliquots 
were then loaded into 96-well plates along with DNM1 at the appropriate MOI (10 
or 100), where applicable. Measurements were taken every 5 min. For ONM1y6 
infections, overnight cultures were launched from single colonies in triplicate and 
diluted 1:200 in HIB supplemented with CaCl, 5 mM (biological replicates). An 
average Deoo nm Was measured after 1 h of growth, and ®NM1y6 was added at an 
MOI of 10 on the basis of this value, where applicable. Measurements were taken 
every 5 min. 

Plasmid-curing assay. RN4220 cells harbouring both the pGG3 CRISPR-Cas 
plasmid and the pWJ153 target plasmid were cultured in TSB supplemented with 
chloramphenicol (10 1g ml~ ") toa Doo nm Of 0.45. After splitting the culture in two, 
transcription across the target was induced for one of the cultures via the addition of 
ATc toa final concentration of 0.25 ig ml” *. Aliquots of cells were harvested before 
(0) and after (1, 2, 3, 4,5 and 6h) the time of induction. After purification of DNA, 
plasmids were linearized with the common single cutter BamHI and subjected to 
agarose gel electrophoresis. In parallel, serial dilutions of both cultures were pre- 
pared in triplicate for each time point and plated on TSA plates supplemented with 
chloramphenicol and erythromycin or chloramphenicol alone, for quantification 
of antibiotic-resistant colony-forming units (technical replicates). 
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Extended Data Figure 1 | Characterization of spacer 32T isolates 
lysogenized with ®NM1-Erm®. a, ®NM2 sensitivity assay. Eight randomly 
selected ®NM1-Erm* lysogen clones were re-streaked through the indicated 
@NM2-seeded region from top to bottom (1-8); C, sensitive @®NM1-Erm® 
lysogen harbouring the pGG3 control plasmid. b, PCR amplification of the 
CRISPR array (upper panel) and spacer 32T target region (lower panel) for 
the strains tested in a. The pGG3 control lysogen (C) lacks a phage-targeting 
spacer in its CRISPR array. Size markers of 1 kb and 0.5kb are indicated. All 
eight PCR products for the target region were sequenced by the Sanger method 
and no mutations were found (data not shown). c, Plaque-forming potential of 
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filtered supernatants from spacer 32T lysogen overnight cultures inoculated in 
triplicate. Plaque-forming units were enumerated on soft agar lawns of RN4220 
harbouring either the pGG3 control (C) or spacer 32T CRISPR plasmids. 
Dotted line represents the limit of detection for this assay. d, DNM2 plaquing 
efficiency on soft agar lawns of an additional six randomly selected DNM1- 
Erm® lysogen clones isolated during infection of RN4220/spacer 32T (9-14); 
a ®NM1-Erm* lysogen harbouring the pGG3 control plasmid was also tested 
(-C/L). Plaquing efficiency on the non-lysogenic indicator strain harbouring 
pGG3 is shown for comparison (-C). Error bars, mean + s.d. (n = 3). 

a, b, Single experiments performed for 8 of 32 isolates. 
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Extended Data Figure 2 | Characterization of spacer 32T isolates 
lysogenized with ®NM4. a, Visualization of TB4-derived strains grown on 
egg-yolk agar. Integration of ®NM4 within the geh locus of TB4 results in 
strongly reduced lipase secretion, enabling a screen for ONM4 lysogenization 
with spacer 32T. Right-most lanes display two lipase-negative isolates from the 
lysogenization screen; picture is representative of five technical replicates for 
each isolate. b, DNM2 sensitivity assay. Strains shown in a were re-streaked 


through the indicated ®NM2-seeded region from top to bottom. The pGG3 
lysogen and spacer 32T non-lysogen in the two left-most lanes serve as sensitive 
and insensitive controls, respectively. Picture is representative of three technical 
replicates for each isolate. c, ®NM2 plaquing efficiency on soft agar lawns 

of the strains analysed in a and b. 32T(L1) and 32T(L2) refer to the two DNM4 
lysogens isolated during the spacer 32T egg-yolk screen. Error bars, mean + s.d. 
(n = 3). 
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Extended Data Figure 3 | Visualization of ®NM1 transcription profiles 6, 
15, 30 and 45 min after infection (MOI 20). Rightward and leftward 
expression values are plotted as blue and fuchsia lines, respectively, in reads per 
million (RPM). Position of relevant spacer targets are indicated with vertical 
solid lines. The dotted line with arrowheads marks the position of the central 


promoter. To improve readability, all curves were smoothened by plotting the 
average reads per million values over a 500 bp sliding-window. To the left of the 
central promoter, rightward expression is comparable to leftward expression 
by 30 min after infection, consistent with the strand-independent targeting 
observed for this region. 
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Extended Data Figure 4 | Type II CRISPR-Cas targeting in S. aureus 
prevents both lytic and lysogenic infection. a, Plaquing efficiency of PDNM1 
and ®NM1y6 on lawns of RN4220 harbouring type I-A CRISPR-Cas plasmids 
as indicated. The parental vector, pDB184, serves as a non-targeting control. 
b, ONM1-Erm*® lysogenization of RN4220 harbouring either the spacer 
43B-tIl, 4B-tlI, or non-targeting type II-A CRISPR plasmids. c, DNM2 
sensitivity assay for seven randomly selected DNM1-Erm*® lysogen clones 
isolated during infection of RN4220/spacer 43B-tII (1-7). For comparison, a 
resistant non-lysogen harbouring the spacer 43B-tII plasmid and a sensitive 
lysogen harbouring the pDB184 plasmid were included as controls 
(respectively, C+ and C—). Picture represents a single experiment for 7 of 22 
isolates. d, DNM2 plaquing efficiency on soft agar lawns for an additional six 
randomly selected ©NM1-Erm* lysogen clones isolated during infection of 
RN4220/spacer 43B-tII (8-13); a DNM1-Erm* lysogen harbouring the 
pDB184 plasmid is also tested (-C/L). For comparison, plaquing efficiency of 
@NM2 on the non-lysogenic indicator strain harbouring pDB184 or the 
targeting spacer 43B-tII plasmid are also shown (—C and +C, respectively). 
e, Agarose gel electrophoresis of plasmid DNA purified from isolates 8-13 and 
the parental spacer 43B-tII strain (C). The symbols + or — indicate the 
presence or absence of treatment with the BamHI restriction enzyme, which 
produces two bands for the wild-type spacer 43B-tII plasmid: 5367 bp and 
3972 bp. Size markers correspond to 10 kb, 3 kb and 0.5 kb bands of the 1 kb 
DNA ladder from NEB. f, Colony PCR spanning the type II CRISPR array for 
isolates 8-13. Spacer 43B-tII plasmid DNA was used as a template for the 


control (C). Size markers of 3 kb and 0.5 kb are indicated. g, Colony PCR 
spanning the target region for isolates 8-13 and a DNMI1-Erm* lysogen 
harbouring the pDB184 control plasmid (C). Isolates 10 and 11 harbour 
identical deletions within the prophage that remove the target region (see 
below). Size markers of 3 kb and 0.5 kb are indicated. The presence of attL and 
attR prophage integration arms was also verified independently for each isolate 
using PCR (data not shown). h, Location of the 16,985 bp deletion identified 
within the prophage harboured by isolates 10 and 11 (shaded grey box). The 
location and orientation of the ermC insertion cassette is also shown (blue 
arrow). Deletion was mapped by primer walking. An ~9.1 kb product spanning 
the deletion was ultimately amplified using primers oGG6 and 0GG241, and 
the deletion junction was sequenced by the Sanger method using oGG245. 

A perfect 14 bp direct repeat micro-homology flanks the deletion. i, Plaque- 
forming potential of overnight culture supernatants from isolates 8, 10 and 11. 
Supernatants were plated by the soft agar method with RN4220 cells 
harbouring the non-targeting pDB184 control plasmid as an indicator strain. 
Supernatants were also plated with spacer 43B-tII targeting lawns, yielding 
no detectable plaque-forming units. Isolate 8 appears to exhibit wild-type levels 
of spontaneous prophage induction (compare with pGG3 control in Fig. 4a). 
No plaque-forming units were detected from the supernatants of isolates 10 and 
11 whatsoever, presumably resulting from their deletion of genes essential 
for prophage induction, including the ORF 43 major capsid protein. Dotted line 
represents the limit of detection for this assay. Error bars, mean + s.d. (n = 3). 
e-g, Single experiments for 6 of 22 isolates. 
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Extended Data Figure 5 | Visualization of transcription profiles for 
@®NM176 and the ®NM1 prophage. Graphical presentation is the same as 
in Extended Data Figure 3. a, ®NM1Y6 transcription profiles 6 and 15 min after 
infection (MOI 20). Comparison with ®NM1 samples at equivalent time points 
(Extended Data Fig. 3) reveals a marked decrease in leftward transcription 

to the left of the central promoter region. We calculated the fold-change in 
reads per million between DNM1 and DNM1y6 samples 15 min after infection. 
Leftward expression within the region bounded by the start of the genome and 
the central promoter was reduced 32-fold, while only a fourfold reduction in 
leftward expression was observed overall. Meanwhile, rightward expression 


was reduced fourfold both overall and in this region. This suggests an 
approximately eightfold net reduction in leftward transcription originating 
from the central promoter. b, ®NM1 prophage transcription profiles. Strong 
leftward transcription originates from the central promoter and a few upstream 
regions, which are presumed to be important for lysogenic maintenance. 
Rightward transcription was weaker than leftward transcription as expected, 
but not absent. Given the strength of rightward transcription observed 
during the lytic cycle (Extended Data Fig. 3), however, this transcription 
may originate from a subpopulation of cells undergoing prophage induction, 
rather than the stable lysogen majority. 
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Extended Data Figure 6 | Detection of transcription across target insertions | PCR was performed on cDNA products, or plasmid DNA templates for control 


for the pNes(wt-d) and pNes(wt-i) plasmids. For each target plasmid”, (C) lanes. The symbols + or — indicate the presence or absence of reverse 
reverse transcription was performed in both directions with DNase-treated transcriptase enzyme in the reverse transcription reaction mixture used for 
total RNA from RN4220 cells harbouring the indicated plasmids, using either | PCR. Size markers of 500 bp and 100 bp are indicated. Picture is representative 
forward or reverse primers for cDNA synthesis in two separate reactions. of a single technical replicate. 
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Extended Data Figure 7 | Reverse CRISPR-immunity assays using inverted 
chromosomal target insertions or type II CRISPR-Cas plasmids. Values 
represent the average transformation efficiency of three transformations in 
colony-forming units per microgram of plasmid DNA transformed. ATc, 
anhydrotetracycline at 0.5 fig ml _*. Dotted lines represent the limit of detection 
for these assays. a, Reverse CRISPR-immunity assays using inverted target 
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vector insertions and spacer 43T or 43B plasmid DNA. Inversion of the attP 
motif (‘Inv-attP-’) for forward and reverse insertion vectors causes integration 
in the opposite orientation relative to the chromosomal origin of replication. 
b, Reverse CRISPR-immunity assays using type II-A CRISPR plasmid DNA 
to transform strains from Fig. 3b. The pDB184 parent vector serves as a 
non-targeting control. Error bars, mean + s.d. (n = 3). 
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Extended Data Figure 8 | Infection with ®NM1 in liquid culture. Growth 
curves of RN4220 cells harbouring the indicated CRISPR plasmids were 
infected at time zero with ®ONM1 at an MOI of 10 (a) or 100 (b). Growth of 
uninfected RN4220/pGG3 cultures is also shown (dotted red lines). 
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Extended Data Figure 9 | Immunity to ©NM1 6 in liquid culture is 
unaffected by the presence of a tolerated chromosomal target. Growth 
curves of the indicated chromosomal insertion strains from Fig. 3 harbouring 
either spacer 43T or pGG3 CRISPR plasmids, in the absence (dotted lines) or 
presence (solid lines) of DNM1y6 addition at an MOI of 10. Black arrow 
denotes the time of phage addition; no ATc induction is used in this assay. 
The presence of a chromosomal target for spacer 43T has no discernable effect 
on culture growth during spacer 43T-mediated immunity to ODNM1y6 
(compare solid green and blue lines). 
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Extended Data Figure 10 | Inducible curing of a target plasmid. a, Diagram 
of plasmids used in the plasmid-curing experiment. The pGG3 CRISPR 
plasmid harbours a single spacer (‘spcl’) targeting a sequence (‘prtspcl’) 
inserted downstream of the P,y1e¢*-inducible promoter in pWJ153. b, Agarose 
gel electrophoresis of linearized plasmid DNA purified both from 
anhydrotetracycline-treated (+ ATc) and untreated (— ATc) cultures at 

the indicated time points. Size markers of 10 kb, 5 kb and 4kb are indicated. 


Picture is representative of a single technical replicate. c, Colony-forming units 
recovered from cultures analysed in b at each time point. Cells were plated 
with selection for either Cm™ colony-forming units (green) or Cm*, Erm* 
colony-forming units (blue). Targeting of the pWJ153 plasmid via induction 
with ATc (filled circles) is accompanied by a severe drop in erythromycin- 
resistant colony-forming units relative to untreated cultures (open circles). 
Error bars, mean + s.d. (n = 3). 
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Rapid fucosylation of intestinal epithelium sustains 
host-commensal symbiosis in sickness 


Joseph M. Pickard!, Corinne F. Maurice, Melissa A. Kinnebrew’, Michael C. Abt?, Dominik Schenten*, Tatyana V. Golovkina’, 
Said R. Bogatyrev®, Rustem F. Ismagilov®, Eric G. Pamer®, Peter J. Turnbaugh? & Alexander V. Chervonsky' 


Systemic infection induces conserved physiological responses that 
include both resistance and ‘tolerance of infection’ mechanisms’. Tem- 
porary anorexia associated with an infection is often beneficial’, real- 
locating energy from food foraging towards resistance to infection* 
or depriving pathogens of nutrients’. However, it imposes a stress on 
intestinal commensals, as they also experience reduced substrate avail- 
ability; this affects host fitness owing to the loss of caloric intake and 
colonization resistance (protection from additional infections)*. We 
hypothesized that the host might utilize internal resources to support 
the gut microbiota during the acute phase of the disease. Here we show 
that systemic exposure to Toll-like receptor (TLR) ligands causes rapid 
a(1,2)-fucosylation of small intestine epithelial cells (IECs) in mice, 
which requires the sensing of TLR agonists, as well as the production 
of interleukin (IL)-23 by dendritic cells, activation of innate lymphoid 
cells and expression of fucosyltransferase 2 (Fut2) by IL-22-stimulated 
IECs. Fucosylated proteins are shed into the lumen and fucose is lib- 
erated and metabolized by the gut microbiota, as shown by reporter 
bacteria and community-wide analysis of microbial gene expression. 
Fucose affects the expression of microbial metabolic pathways and 
reduces the expression of bacterial virulence genes. It also improves 
host tolerance of the mild pathogen Citrobacter rodentium. Thus, 
rapid IEC fucosylation appears to be a protective mechanism that 
utilizes the host’s resources to maintain host-microbial interactions 
during pathogen-induced stress. 

To maintain healthy gut microbiota during a systemic response in- 
duced by microbial products the host may use its internal resources. 
L-Fucosylation could present an example of such a resource: L-fucose 
attached to glycoproteins and glycolipids is accessible for microbial, but 
not for host energy harvest’*. Constitutive o(1,2)fucosylation affects the 
microbial community in a diet-dependent manner”, serves as a substrate 
for pathogens during antibiotic exposure’° and for microbes colonizing 
the ileum of newborns or of adult germ-free (GF) mice'’’. Under normal 
conditions, however, the small intestine of specific pathogen-free (SPF) 
BALB/c mice is largely free of surface fucose. By contrast, a systemic injec- 
tion of agonists of TLRs, such as lipopolysaccharide (LPS; TLR4 ligand) 
(Fig. 1), CpG DNA (TLR9 ligand), or Pam3CSK, (TLR2 agonist), led to 
ubiquitous o(1,2)fucosylation of the small intestine in mice of different 
genetic backgrounds, which started within a few hours after LPS expo- 
sure and lasted for several days (Extended Data Fig. la-c). It did not 
result in differentiation of IECs into functional M cells'*, which are 
permanently fucosylated and are involved in microbial sensing and 
translocation (Extended Data Fig. 1d). Induced fucosylation was inde- 
pendent of the gut microbiota (observed in GF mice), and was not 
induced by oral LPS (Extended Data Fig. 1e). 

Global deletion of the TLR signalling adaptor molecule MyD88 pre- 
vented IEC fucosylation, and its conditional deletion from dendritic cells, 
but not from IECs, abrogated the process (Fig. 1). The inducible fuco- 
sylation pathway was similar to induction of antimicrobial peptides by 
a systemic microbial signal’*: it required MyD88-expressing dendritic 


cells, production of IL-23, the transcriptional regulator RORyt and IL-22 
(Fig. 1 and Extended Data Fig. 2a), and was induced by a direct injection 
of IL-22 into Myd88'~ mice (Fig. 1). IEC fucosylation in mice lacking 
T cells (Fig. 1) suggested that innate lymphoid cells (ILCs) were a suffi- 
cient source of IL-22. Salmonella enterica ssp. Typhimurium, known to 
spread systemically, induced small intestine IEC fucosylation (Extended 
Data Fig. 2b). The «(1,2)fucosyltransferase responsible for fucosylation 
of IECs in the small intestine was identified as Fut2 (Fig. 2a), which is 
inducible by stress conditions'*” and constitutively expressed in the stom- 
ach and large intestine’*. Genetic ablation of the Fut2 gene blocked IEC 
fucosylation in response to LPS (Fig. 2b, c). The overall chain of events 
is shown in Extended Data Fig. 3. 

LPS injection caused marked sickness in BALB/c Fut2-deficient 
(Fut2~'~) and Fut2-sufficient (Fut2*'* or Fut2*'~) littermates hours 
after injection: mice displayed measurable anorexia, adipsia, reduced activ- 
ity, diarrhoea and weight loss. Food consumption, weight loss, and pro- 
duction of inflammatory cytokines and antimicrobial peptides (Fig. 2d, e 
and Extended Data Fig. 4a, b) were similar in both groups. Although 
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Figure 1 | MyD88-dependent fucosylation of small intestine IECs by 
systemic stimulation of TLRs. UEA-1, which binds «(1,2)-fucosylated 
substrates, staining in the proximal one-third of the small intestine of mice 
untreated or 24h after intraperitoneal (i.p.) LPS injection, or 6 h after injection 
of IL-22 (Myd88"'~ mouse). Scale bars = 100 tm. Staining of tissue from 
mutant mice was always accompanied by staining of wild-type (WT) controls, 
and is representative of at least two independent experiments for each 


genotype. 
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Figure 2 | Consequences of the loss of Fut-2- 
dependent fucosylation. a, Expression of mouse 
o.(1,2)fucosyltransferase genes (Fut2, Fut1 and 
Sec1) and control (villin-encoding gene, Vill) in 
the gut 24h after LPS injection (semiquantitative 
polymerase chain reaction with reverse 
transcription (RT-PCR)). Caec, caecum; Col., 
colon; SI, small intestine. b, Intestinal fucosylation 
(green) of Fut2-sufficient and Fut2-deficient mice. 
Red, propidium iodide. Scale bars = 100 um. 
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Fut2-'~ mice have been shown to be healthy under normal SPF condi- 
tions’’, they were significantly slower than controls in recovering their 
body weight (Fig. 2e) after LPS injection. Importantly, food deprivation 
without LPS challenge did not cause small intestine fucosylation and 
the weight recovery was identical in Fut2-sufficient and Fut2-deficient 
animals (Extended Data Figs 1a and 4c). Thus, fucosylation of the small 
intestine was a response to activation by microbial stimuli and not to 
anorexia per se. 

To understand the reason for the slow weight recovery in Fut2-/~ 
mice, we tested whether fucosylation affected the function of host IEC 
proteins, which were identified by direct sequencing as secreted mucins 
and digestive enzymes (Fig. 3a). Fucosylation did not change the activ- 
ity of several enzymes (Fig. 3b). Thus, changes in enzymatic activity are 
unlikely to explain the slow weight recovery in Fut2'~ mice, although 
the role of less abundant fucosylated proteins cannot be excluded. 

The beneficial effect of fucosylation was dependent on the microbiota: 
the weight recovery after LPS challenge was delayed in GF mice and in 
wild-type SPF mice treated with antibiotics (Fig. 2e). Antibiotics did 
not have a direct effect on the host’s responses to LPS (Extended Data 
Fig. 4a, b, d-g) and did not further impair the recovery of Fut2-/~ mice 
(Fig. 2e). Thus, Fut2 and an intact gut microbiota were both necessary 
for efficient recovery of body weight after LPS challenge. 

To serve as a substrate to the bacteria residing in the large intestine”®, 
fucosylated proteins (Extended Data Fig. 5a—-d) must be released into 
the small intestine lumen. After LPS challenge, Fut2-dependent fucosy- 
lation of luminal proteins was detectable at much higher levels in GF 
mice and in SPF mice treated with antibiotics or with a fucosidase inhib- 
itor, than in SPF controls (Fig. 3c and Extended Data Fig. 5e). Thus, fucose 
is available, released and used by microbes in the large intestine. 

Next, to directly show fucose sensing by intestinal commensal bacteria, 
we used a reporter system in which Escherichia coli expressed green fluo- 
rescent protein (GFP) driven by the promoter of the E. coli fucose meta- 
bolism operon*'” (Fig. 3d, e and Extended Data Fig. 6a). Because E. coli 
lacks «(1,2)fucosidase that cleaves fucose off substrates, in GF mice mono- 
colonized with the reporter E. coli it did not upregulate GFP, even after 
LPS injection (Fig. 3d). Thus, free fucose was not readily available for re- 
porter bacteria in the gut and required bacterial fucosidase activity, which 
was sensitive to antibiotics (Extended Data Fig. 6b). A commensal bac- 
terium with «(1,2)fucosidase activity, Bacteroides acidifaciens, was isolated 


from our mouse colony (Extended Data Fig. 6c-e). In LPS-injected GF 
mice co-colonized with the reporter E. coli and B. acidifaciens, the reporter 
strain expressed significantly more GFP (Fig. 3d), as well as genes for 
fucose import (fucP) and metabolism (fucA) (Fig. 3f). In LPS-treated SPF 
mice, Fut2 was required for GFP reporter expression (Fig. 3e and Ex- 
tended Data Fig. 6f). These findings made it clear that fucose can serve 
as a substrate for the microbiota under conditions of stress applied to 
the host, and underscored the interdependence between members of the 
gut microbial community”. 

To confirm these findings in mice with complex gut microbiota, we 
profiled microbial community structure, gene abundance and transcrip- 
tional activity before and after LPS treatment of Fut2‘’~ and Fut2-/~ 
mice (Supplementary Table 1). Analyses of community structure based 
on 16S and shotgun DNA sequencing revealed that the gut microbiota 
was largely robust to host genotype and LPS exposure (Fig. 3g and Ex- 
tended Data Fig. 7a, b). We did not detect: (1) significant clustering of 
microbial communities based on genotype or LPS treatment (P > 0.05 
for both comparisons; permutational multivariate analysis of variance 
(PERMANOVA) of unweighted UniFrac distances); (2) species-level 
operational taxonomic units (OTUs) significantly associated with host 
genotype before or after LPS treatment (all were q > 0.05; analysis of 
variance (ANOVA)); or (3) significant differences in overall microbial 
diversity (Extended Data Fig. 7c). However, we were able to detect a sig- 
nificantly increased abundance of B. acidifaciens after LPS treatment 
in Fut2-sufficient mice (P < 0.05, linear discriminant analysis (LDA) 
score > 4; linear discriminant analysis effect size (LefSe) analysis of 16S 
profiles), consistent with its ability to utilize fucosylated glycans. 

At the same time, LPS markedly altered community-wide gene express- 
ion in both Fut2*’~ and Fut2~/~ mice (Fig. 3h), with multiple ortholo- 
gous groups differentially expressed upon LPS treatment: 61 in Fut2*/~ 
mice and 56 in Fut2-‘~ animals. These changes were not due to altered 
community structure: only one differentially expressed orthologous group 
(K05351, xylulose reductase) also exhibited significant changes in gene 
abundance (Supplementary Table 2). As expected, we detected a signi- 
ficant upregulation of fucose permease (fucP; K02429) in Fut2-sufficient 
mice after exposure to LPS (Fig. 3i), and increased expression of meta- 
bolic modules for anaerobic respiration, protein and ATP synthesis, iso- 
prenoid biosynthesis and amino sugar import, in addition to pathways 
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Figure 3 | Commensals utilize fucose detached from proteins fucosylated 
by Fut2 upon systemic challenge with LPS. a, Silver-stained SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) of UEA-1-precipitated small 
intestine IEC protein from control or LPS-treated mouse. b, Ratios of digestive 
enzyme activities in small intestine IECs of LPS-treated to untreated Fut2- 
sufficient (black bars) or Fut2-deficient (open bars) mice 2 days after LPS 
injection. Mean ~ s.e.m. of four combined experiments; four mice per group. 
NS, not significant (P > 0.05, two-tailed Student’s t-test). c, SDS-PAGE of 
intestinal contents blotted on nitrocellulose and stained with UEA-1- 
peroxidase complexes. abx, antibiotic-treated mice; c, caecum; DFJ, 
deoxyfuconojirimycin; f, faeces; s, small intestine. Data are representative of at 
least two experiments. d, e, Fucose-sensitive GFP reporter expression in 
gnotobiotic mice colonized with the indicated strains (d) or SPF mice (e). Dots 
are values for individual bacteria, lines are means; n = 120. *P < 0.05, one-way 
ANOVA with Bonferroni post-test (d), two-tailed Student’s t-test (e). Data are 


for aminoglycan degradation (Extended Data Fig. 7d). Thus, intact host 
fucosylation appears to affect gut microbial metabolism. 

Importantly, LPS challenge led to the significantly increased express- 
ion of microbial virulence genes in Fut2-negative but not Fut2-sufficient 
mice, including rtxA (K10953) and haemolysin III (K11068) (Supplemen- 
tary Table 2). Kyoto Encyclopedia of Genes and Genomes (KEGG) path- 
ways potentially involved in microbial pathogenesis (defined as flagellar 
synthesis, chemotaxis, plant/pathogen interaction and Vibrio cholera in- 
fection) were overrepresented in Fut2-deficient mice (Extended Data 
Fig. 7d). We hypothesized that fucosylation induced by systemic microbial 
challenge might limit the effects of additional exogenous or endogenous 
pathogens. We tested this by infecting Fut2-sufficient and Fut2-deficient 
mice with a non-lethal intestinal pathogen, C. rodentium. Four days after 
infection, mice were treated with LPS. Infected Fut2-negative mice lost 
significantly more weight than Fut2-sufficient animals compared with 
respective LPS-treated non-infected controls (Fig. 4a). Thus, infection 
with a non-lethal pathogen further reduced the overall fitness of Fut2- 
deficient mice in response to LPS. C. rodentium did not induce small 
intestine IEC fucosylation and did not colonize the small intestine (Ex- 
tended Data Fig. 8), indicating that systemic challenge by a microbial 
product was required to reveal the role of inducible fucosylation. 

The fitness of infected animals can be maintained through either de- 
creased pathogen burden (resistance), or by an increase in pathogen tol- 
erance without a change in pathogen burden. We quantified the abundance 
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representative of three independent experiments. f, fucA and fucP gene 
expression relative to housekeeping gene rpoA (quantitative RT-PCR) in E. coli 
tested as in d. *P < 0.05 by ANOVA with Bonferroni’s post-test. Data are 
combined from three experiments. g, Stable relative abundance of bacterial 
phyla across treatment groups and genotypes, as indicated by shotgun 
sequencing of community DNA. Phyla with a mean reads per kilobase per 
million mapped reads (RPKM) >40,000 are shown, including Actinobacteria 
(purple), Bacteroidetes (red), Firmicutes (blue), Fusobacteria (green), 
Proteobacteria (orange) and Tenericutes (yellow). 16S rRNA gene sequencing 
confirms these observations. Extended Data Fig. 7 shows 16S rRNA gene 
sequencing results. h, Differentially expressed KEGG orthologous groups 
following LPS treatment (paired glm edgeR analysis; q < 0.05, >2-fold change; 
see Supplementary Table 2 for complete list). i, Increased gut microbial 
expression of fucP (K02429) in Fut2-sufficient mice (mean + s.e.m.; *P < 0.01, 
Mann-Whitney test). 


of C. rodentium in the faeces of infected mice, as well as the adherence 
to IECs of C. rodentium-expressing luciferase**”* (Fig. 4b, c). No differ- 
ences in pathogen loads were found between Fut2-sufficient and Fut2- 
deficient animals treated with LPS. Thus, fucosylation of the small intestine 
upon systemic treatment with LPS probably enhances tolerance of the 
pathogen. Fut2-negative mice infected with C. rodentium and injected 
with LPS had more pronounced colonic hyperplasia (a trademark of this 
infection) compared with Fut2-sufficient mice or mice that did not 
receive LPS (Fig. 4d, e). 

Thus, inducible IEC fucosylation might be viewed as an emergency 
measure taken by the host to support its gut commensals. Fucose used 
by microbes as an energy source may contribute to protection of the host 
from endogenous opportunistic pathogens, or it could increase tolerance 
of infection by regulating bacterial genes responsible for quorum sens- 
ing” or virulence”. Fucose can also serve as a substrate for microbial pro- 
duction of the short-chain fatty acid propionate (Extended Data Fig. 9), 
which is primarily produced by members of the Bacteroidetes phylum”. 
Whether this process contributes to overall fitness of the animals under 
infection-induced stress remains to be elucidated. Of note, around 20% 
of humans lack a functional FUT2 gene, which is linked to Crohn’s dis- 
ease” and to lethality from sepsis in premature infants*’. Overall, fucosy- 
lation of the small intestine in response to systemic microbial exposure can 
be considered a type of ‘tolerance of infection’ response. It is interesting, 
however, that a very similar pathway regulates secretion of antimicrobial 
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Figure 4 | Host fucosylation increases tolerance of a pathogen. a, Difference 
in percentage weight loss between LPS-injected C. rodentium-infected and 
uninfected mice. Data are shown as mean + s.e.m.; *P = 0.01 ***P = 0.0001, 
two-tailed Student’s ¢ test; combined from six experiments. NS, not significant. 
b, Faecal colony forming units (c.f.u.) of C. rodentium from Fut2~ or Fut2~ 
mice (mean + s.e.m., representative of three experiments). c, Luminescence of 
thoroughly washed mid-colon of mice infected with pler-lux" C. rodentium. 
n= A4 for day (d)12 and 8 for d7; mean + s.e.m. RLU, relative luminescence 
units; p.i., post-infection. Data combined from three experiments. d, Average 
crypt depth in uninfected (n = 3) or infected mice (n = 4) at day 12 post- 
infection, with or without LPS injection on day 4 post-infection. Data are 
shown as mean + s.e.m. (two-tailed Student’s t-test). In a-d black bars and 
circles indicate Fut2-positive mice; open bars and circles indicate Fut2-negative 
mice. e, Representative haematoxylin and eosin staining of distal colon of 
LPS-treated mice at day 12 after infection. Scale bars = 100 Lm. Bottom row 
shows magnified boxed regions. 


proteins—a resistance mechanism”. Thus, the mechanisms of resistance 
and tolerance to pathogens could be evolutionarily linked together to 
increase the fitness of the host. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. BALB/cJ, C57BL/6J, NOD/LtJ, C3H/HeN, Rag! /~ (B6.129S7-Rag1™M°"/]), 
Fut2~'~ (B6.129X1-Fut2"™8“°/J), Villin-Cre (B6.SJL-Tg(Vil-Cre)997Gum/J), and 
CD11c-Cre (C57BL/6J-Tg(Itgax-Cre,-EGFP)4097Ach/J) mice were purchased from 
The Jackson Laboratory. Fut2-'~ mice were backcrossed greater than seven gen- 
erations to BALB/c. Knockout mice produced litters of mixed genotypes kept co- 
housed to homogenize their gut microbiota. B6 Myd88'~ mice were a gift from 
S. Akira. B6 MyD88 floxed mice were described previously*!. Rore ‘~ (ref. 32), and 
IL23p19 '~ (also known as I123a ‘~)** mice were provided by Y.-X. Fu. 11227/~ 
mice™ were maintained at the Memorial Sloan-Kettering Cancer Center. Mice were 
housed in a specific pathogen-free facility and used in accordance with institutional 
guidelines for animal welfare. Six- to twelve-week-old male and female mice were 
used for randomization purposes. The numbers of mice per group were chosen as 
the minimum needed to obtain biologically significant results, based on previous 
experience. Evaluations were made ina blind fashion. Functional experiments were 
done with Fut2-negative mice on the BALB/c genetic background using Fut2- 
sufficient littermates as controls. 

Metabolic studies. Mice were single-housed in TSE Systems LabMaster cages to 
monitor physical activity (x/y and z axes by infrared beams crossed), drinking and 
feeding. 

Fucosylation activation in vivo. LPS from S. enterica ssp. Typhimurium (#L6511, 
Sigma-Aldrich) was injected ip. at 1 ugg ' body weight, or gavaged to GF mice at 
1 mg in 400 jl sterile PBS. CpG (100 jg CpG-B ODN 1826, Coley Pharmaceutical 
Group) and Pam;CSK, (100 ug, Invivogen), were injected i.p. Carrier-free recom- 
binant mouse IL-22 (Biolegend,) was diluted in 1% BSA/PBS and 1.5 jg was given 
ip. 

Antibiotic treatment. Ampicillin (1 g1~') and 200 mg1~! vancomycin (Sigma- 
Aldrich) were 0.22 jm filtered and added to autoclaved drinking water starting 
2 days before LPS treatment (day —2) and lasting for the duration of the experiment. 
Lectin staining. For whole-mount staining, small intestine was removed, a 1 cm 
piece from the upper third was excised, opened, cleaned of mucus in cold PBS, and 
incubated with Ulex europaeus aggluitinin-1 (UEA-1) conjugated to FITC, TRITC 
or atto-594 (Vector Laboratories or Sigma-Aldrich) for 15 min on ice. Tissue was 
placed lumen side up on a slide for microscopy. For [122~‘~ experiments, whole 
tissue was fixed in 4% paraformaldehyde before proceeding with whole-mount 
staining. Fixation did not affect staining pattern or intensity. 

For staining of sections, tissues were fixed in 2% paraformaldehyde overnight at 
4 °C, cryoprotected in 20% sucrose/PBS overnight at 4 °C, and embedded in OCT 
compound (Sakura Finetek), and 10 jim sections were stained with UEA-1-FITC 
(1 pg ml | Vector) for 30 min at room temperature, and incubated for 20 min at 
37 °Cin propidium iodide (0.5 jig ml’, Sigma-Aldrich) with RNase A (10 1g ml’, 
Sigma-Aldrich) to label nuclei. 

For single-cell analysis by FACS, small intestine (with Peyer’s patches and mes- 
enteric fat removed) was divided into equal thirds, opened longitudinally and washed 
with cold PBS. Tissue was cut into 1 cm pieces and shaken in 10 mM EDTA/1 mM 
dithiothreitol (DTT)/PBS at 37 °C for 20 min, and filtered through nylon mesh. 
Single-cell suspensions were pelleted and fixed in 5 ml of 1% paraformaldehyde 
overnight at room temperature. Cells were then stained with UEA-1-FITC (1 pg ml’, 
Vector), and gated on the FSC/SSC high epithelial cell population. 

Light microscopy. Fluorescence microscopy of whole-mounts and sections used a 
Leica DM LB microscope (Leica Camera AG) and Spot RT Slider camera and soft- 
ware (Diagnostic Instruments). Confocal microscopy of bacteria was performed 
with an Olympus DSU spinning disk microscope and Slidebook software (31). All 
images in an experiment were taken using the same exposure settings. 

Scanning electron microscopy. Tissues were fixed in 2% paraformaldehyde/2% 
glutaraldehyde/0.1 M cacodylate buffer, transferred to cacodylate buffer overnight 
and processed with the OTOTO procedure*’. To dehydrate, samples were passed 
through increasing concentrations of acetone in water, followed by hexamethyl- 
disilazane (Electron Microscopy Sciences). Samples were mounted with colloidal 
silver paste (Electron Microscopy Sciences) and imaged with an FEI Nova NanoSEM 
230 (FEI) at 5kV. 

Identification of fucosylated proteins. To visualize fucosylated proteins in the 
lumen contents, the intestinal contents were gently removed by squeezing and homo- 
genized at 1 g sample per 5 ml of Tris-Triton-X-100 buffer (150 mM NaCl, 50 mM 
Tris pH 8, 1% Triton X-100, and protease inhibitor tablet (Roche)) on ice and spun 
at 17,000g for 20 min at 4°C. Supernatant proteins were separated by 6% non- 
reducing SDS-PAGE, transferred to PVDF membrane (Bio-Rad), blocked with 
0.5% gelatin in 0.05% Tween-20/PBS, and visualized with UEA-1-HRP (1 pg ml~ = 
Sigma-Aldrich). 

For sequencing of fucosylated epithelial cell proteins, small intestine epithelial 
cells were isolated as described earlier and lysed in Tris-Triton-X-100 buffer. Lysates 
were pre-cleared by incubating with unconjugated agarose beads (Vector) twice for 
45 min at 4 °C. Cleared lysates were then incubated with washed UEA-1-conjugated 


beads (Vector) for 45 min. Beads were washed five times to remove unbound protein. 
To elute UEA-1-bound protein, beads were incubated with 200 mM L-fucose (Sigma- 
Aldrich) for 30 min. Eluted proteins were separated on a 44-15% gradient SDS-PAGE 
gel (Bio-Rad), silver-stained (Thermo Fisher Scientific), and bands were excised 
for identification by mass spectrometry at the Taplin Biological Mass Spectrometry 
Facility, Harvard Medical School. 

Isolation of fucosidase-positive bacteria. Brain heart infusion (BHIS) agar plates 
(Becton, Dickinson, and Co.) with added thioglycolic acid, menadione and hematin/ 
histidine (Sigma-Aldrich) were pre-reduced in an anaerobic chamber with 2.5% hy- 
drogen atmosphere at 37 °C, and spread with 40 pl of 5-bromo-4-chloro-3-indolyl- 
a-L-fucopyranoside (50 mM in DMEF) (Carbosynth). Faecal pellets were homogenized 
in reduced 0.01% thioglycolic acid/PBS, plated, and grown for 3 days at 37 °C anae- 
robically, then at 4 °C aerobically to develop colour. Blue colonies were identified 
by sequencing of their 16S rDNA genes using primers 8F (5'-AGAGTTTGATCC 
TGGCTCAG-3’) and 1391R (5'-GACGGGCGGTGWGTRCA-3’), and sequencing 
of their gyrB genes using primers gyrB F (5'-GAAGTCATCATGACCGTTCTGC 
AYGCNGGNGGNAARTTYGA-3’) and gyrB R (5’-AGCAGGGTACGGATGT 
GCGAGCCRTCNACRTCNGCRTCNGTCAT-3’) for amplification and gyrB FS 
(5'-GAAGTCATCATGACCGTTCTGCA-3’) and gyrB RS (5’-AGCAGGGTAC 
GGATGTGCGAGCC-3’) for sequencing’’. B. thetaiotaomicron VPI-5482 and 
B. uniformis (a gift from C. Nagler) were used as controls. 

Measurement and inhibition of fucosidase activity. Faecal pellets were weighed 
and homogenized in 10 pl PBS per 1 mg sample, and centrifuged at 17,000g for 
10 min. Fifty microlitres of this supernatant was incubated for 1 h at 37 °C with or 
without adding 0.5 tl of 50 mM 4-methylumbelliferyl fucopyranoside (in dimethyl- 
sulphoxide (DMSO), Sigma or Gold Biotechnology). The reaction was diluted 
100-fold in 0.2 M glycine-NaOH buffer, pH 10.5, and fluorescence was measured 
at 365 nm excitation, 445 nm emission. Fluorescence of the no-substrate control 
was subtracted from the substrate-containing reaction. The amount of cleaved 
substrate and fucosidase activity was then calculated by comparison to a standard 
curve of 4-methylumbelliferone (Sigma) in glycine-NaOH buffer. Deoxyfucono- 
jirimycin (DFJ; Enzo Life Sciences) was dissolved in PBS and gavaged to mice in 
100 ul (5 pmol total). 

Reporter E. coli. The pXDC94 plasmid’, containing the ptac promoter control- 
ling mCherry expression and a multiple cloning site upstream of the promoter-less 
GFP gene, wasa gift from H. Shuman. The promoter region upstream of the E. coli 
fucPIK genes” was amplified using primers that added restriction sites (bold) at 
the 5’ and 3’ ends of the amplicon, respectively: fucproF, 5'-TATGGTACCGGAT 
TCATTTTCAAATAAAA-3’; fucproR, 5'-TATCCCGGGTAGCTACCTCTCTC 
TGATTC-3’. The PCR product and vector were digested with XmalI and KpnI 
(New England Biolabs), gel purified, ligated, and introduced into E. coli K-12 strain 
BW25113 (Yale Coli Genetic Stock Center) by electroporation. Correct expression 
of mCherry and GFP was verified by growth in minimal medium” with 10 mM 
glucose and the indicated concentrations of fucose. 

E. coli K12 bacteria carrying the pXDC94-fucPIK-pro reporter were grown 

overnight in Luria-Bertani broth (LB) with shaking at 37 °C, centrifuged at 5,000g, 
resuspended to ~10° c.f.u. ml” ' in PBS, and 400 ull gavaged to mice that had received 
LPS i.p. 6h earlier (SPF mice), or at the time of gavage (gnotobiotic mice). For dual 
colonization with B. acidifaciens, 100 1l of stationary-phase culture was gavaged at 
the same time as E. coli. Twenty-four hours after gavage, mice were killed and their 
caecum and colon contents were homogenized in PBS. Bacteria were enriched by 
centrifuging at 200g for 5 min, centrifuging the supernatant at 5,000g for 5 min, and 
resuspending the pellet in 750 kl PBS. This was underlain with 300 jl Histopaque- 
1119 (Sigma-Aldrich) and centrifuged for 1 min at 11,600g. The interface contain- 
ing mostly bacteria was washed with PBS, fixed in 2% paraformaldehyde for 20 min 
at room temperature, washed and resuspended in 50-100 il PBS. Four microlitres 
of bacterial suspension was placed on a slide and coverslipped. Random fields were 
selected using red fluorescence only, and an image was acquired in both the red and 
green channels. Fluorescence of individual bacteria were measured in ImageJ 1.41 
software™ by gating on areas of red fluorescence and measuring the mean green 
pixel intensity within the gated area. 
Estimation of total bacterial loads. Fresh faecal pellets were placed in an anaer- 
obic chamber and mashed in 500 il of reduced PBS containing 0.01% thioglycolic 
acid. Serial dilutions were made in reduced PBS and plated on pre-reduced Brucella 
blood agar plates (Becton, Dickinson, and Co.; for anaerobic counts), removed from 
the anaerobic chamber, and plated on tryptic soy agar (TSA)/5% sheep’s blood plates 
(Becton, Dickinson, and Co.; aerobic counts). Total colonies were counted after 
2 days (aerobic) or 3 days (anaerobic) incubation at 37 °C. 

For 16S copy number, faecal pellets were weighed, DNA was isolated by a bead 
beating and phenol/chloroform extraction method”, and qPCR performed and 
copy number determined as described”. 

S. enterica Typhimurium and C. rodentium infection. For S. enterica Typhimurium 
infection, mice were gavaged with 20 mg streptomycin (Sigma) in 100 Ul sterile 
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water, 24h before infection. S. enterica Typhimurium strain SL1344 was grown 
overnight in LB with streptomycin (50 jg ml _') and gavaged to mice at 5 X 10° cfu. 
in 100 pl volume. 

C. rodentium strains DBS100 or DBS120 pler-lux were grown in LB overnight 
at 37 °C, then diluted 1:100 and grown for 2.5h, centrifuged and resuspended in 
0.01 volumes PBS, and mice were gavaged with ~5 X 10” bacteria in 100 pl LB. To 
determine mouse colonization levels, a fresh faecal pellet or small intestine contents 
(gently squeezed to remove, except last 3 cm of ileum) was weighed, mashed in 500 ul 
of PBS, serially diluted, and plated on MacConkey (Becton, Dickinson, and Co.) 
or LB agar with 50 1g ml’ kanamycin. For luciferase measurements, the faecal 
homogenate was adjusted to 10 mg in 1 ml PBS in an Eppendorf tube and light 
measured in a Triathler scintillation counter (Hidex), before plating dilutions in 
PBS on agar with kanamycin. Colon-adherent bacteria were measured in a 1 cm 
piece from the middle of the colon. The piece was opened longitudinally, washed 
in 1mM DTT/PBS by vortexing for 10s, then washed in PBS, and placed in an 
Eppendorf in 500 11 PBS for light measurement as before. The piece was then mashed 
and dilutions plated on agar with kanamycin. 

Histology. Distal colon from C. rodentium-infected mice or uninfected controls 
was fixed in neutral formalin, then kept in 70% EtOH until being embedded in paraffin, 
and 5 jm sections were cut and stained with haematoxylin and eosin. Well-oriented 
crypts were photographed and their lengths measured in ImageJ, and the mean 
taken for each mouse. 

Serum cytokine ELISA. Serum concentrations were measured by ELISA accord- 
ing to the manufacturer’s instructions (IL-1: eBioscience; IL-6 and TNF-a: Becton, 
Dickinson and Co.). 

Short-chain fatty acid measurements. For gavage experiments, food was removed 
and SPF mice were gavaged at 0, 3 and 6h with 300 ul of 0.1 M L-fucose or 0.1M 
D-galactose (Sigma-Aldrich) in autoclaved tap water, or water only. At 8h, mice 
were killed and caecal contents removed and kept at —80 °C until processing. 

Concentrations of the short-chain fatty acids in caecal contents were measured 
using the direct injection gas chromatography-mass spectrometry (GC-MS) method 
adapted from that described previously*!’. 

Caecal contents were extracted in two steps. Deionized water was added to the 
samples at the amount of 5 ml per 1 g of sample, followed by brief vortex-mixing 
and sonication for 15 min. In the second step, acetonitrile (#1103, BDH) contain- 
ing 20 mM of tetradeuteroacetic acid (#16621, Acros Organics) was added to the 
aqueous extracts at the amount of 5 ml per 1 g of initial sample, followed by the 
second round of brief vortex-mixing and sonication for 15 min. 

Extracted samples were centrifuged for 5 min at 17,000g and 0.02 ml of clear super- 
natant was mixed with 0.98 ml of acetonitrile containing 20 mM of formic acid 
(#94318, Fluka) and 0.05 mM of 2-ethylbutyric acid (#109959, Aldrich). The mix- 
tures were briefly vortex-mixed and centrifuged for 5 min at 17,000g and 1 1] of the 
obtained supernatants were analysed by direct injection GC-MS on an Agilent 6890N 
GC system equipped with a Mass Selective (MS) Detector 5973 (Agilent Technologies). 

The instrument was used in a splitless mode with an installed double taper inlet 
liner (#23308, Sky by Restek) and fused-silica column with polyethylene glycol 
stationary phase (INNOWax #19091N- 133, J&W Scientific, Agilent Technologies) 
30m X 0.25 mm ID coated with 0.25 um film. 

Helium carrier gas was supplied at 1.00 ml min’ flow rate. The injection port 
temperature was 260 °C. The initial oven temperature of 60 °C was maintained for 
2 min, then increased to 150 °C at 10 °C min! and further to 250 °C at 25 °C min! 
and held at that temperature for 4 min, bringing the total duration of the run to 
19 min. The MS detector temperature was set to 245 °C. 

GC-MS data were analysed using the MSD ChemStation D.01.02.16 software 
(Agilent Technologies). Tetradeuteroacetic and 2-ethylbutyric acids served as internal 
standards. 

Enzymatic activity of digestive enzymes. Epithelial cells were dissociated from 
the upper half of the small intestine as described earlier. Haematopoietic cells were 
removed with anti-CD45-biotin (clone 30-F11, Biolegend) and streptavidin mag- 
netic beads with a MACS LS column (Miltenyi Biotec). Epithelial cells were diluted 
in 96-well plates in triplicate in appropriate substrate solutions (for sucrase, 60 mM 
sucrose in PBS; for maltase, 30 mM maltose in PBS; for aminopeptidase, 5 mM leu- 
pNA (Enzo Life Sciences) in 50 mM TRIS, pH 7.4). Cells were incubated for 30- 
60 min at 37 °C. For the aminopeptidase assay, absorbance was measured at 405 nm. 
For the sucrase and maltase assays, plates were centrifuged and 5 1] of supernatant 
was used for a colorimetric assay to measure liberated glucose (Cayman Chemical). 
Metagenomic library preparation and sequencing. Methods for microbial com- 
munity DNA and mRNA sequencing were as previously described***’. Faecal pel- 
lets collected from mice before or after LPS treatment (pooled from 2 and 3 days 
post-injection) were kept at —80 °C until DNA and RNA isolation using the gua- 
nidinium thiocyanate/caesium chloride gradient method” as described, except that 
crude particles were removed by centrifugation before overlaying the gradient. RNA 
was subjected to DNase treatment (Ambion), purification using MEGA Clear columns 
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(Ambion), and rRNA depletion via subtractive hybridization (MICROBExpress, 
Ambion, in addition to custom depletion oligonucleotides). The presence of geno- 
mic DNA contamination was assessed by PCR with universal 16S rDNA primers. 
cDNA was synthesized using SuperScript II and random hexamers (Invitrogen), 
followed by second-strand synthesis with RNaseH and E.coli DNA polymerase (New 
England Biolabs). Samples were prepared for sequencing with an Illumina HiSeq 
instrument after enzymatic fragmentation (NEBE6040L/M0348S). Libraries were 
quantified by qPCR according to the Illumina protocol. qPCR assays were run using 
ABsoluteTM QPCR SYBR Green ROX Mix (Thermo Scientific) on an Mx3000P 
QPCR System instrument (Stratagene). The size distribution of each library was 
quantified on an Agilent HS-DNA chip (Agilent). 

16S rRNA gene sequencing and analysis. Community DNA was PCR-amplified 
using universal bacterial primers targeting variable region 4 of the 16S rRNA gene 
with the following thermocycler protocol: denature at 94°C for 3 min, 35 cycles 
of 94°C for 45 s, 50 °C for 30 s, and 72 °C for 90 s, with a final extension at 72 °C for 
10 min**”"*, Triplicate reactions for each sample were pooled and amplification 
was confirmed by 1.5% gel electrophoresis. 16S rRNA gene amplicons were cleaned 
with the Ampure XP kit (Agencourt) and quantified using the Quant-iT Picogreen 
ds DNA Assay Kit (Invitrogen). Barcoded amplicons from multiple samples were 
pooled and sequenced using the Illumina HiSeq platform”. 16S rRNA gene sequences 
were analysed using the QIIME (Quantitative Insights Into Microbial Ecology)“ 
software package along with custom Perl scripts. Data sets were randomly subsampled 
before clustering analyses at a depth that retained all of the individual samples (180,000 
sequences per sample). All sequences were used for the comparison of the relative 
abundance of bacterial taxonomic groups. OTUs were picked at 97% similarity against 
the Greengenes database (constructed by the nested_gg_workflow.py QiimeUtils 
script on 4 February 2011), which we trimmed to span only the 16S rRNA region 
flanked by our sequencing primers (positions 521-773). The LefSe package was 
used to identify taxonomic groups significantly associated with each treatment”. 
LefSe was run on the sub-sampled data sets, after filtering out species-level OTUs with 
<100 sequences or present in only 1 sample. Statistical analyses were also performed 
using the QIIME scripts ‘otu_category_significance’ (ANOVA) and ‘compare_ 
categories.py (PERMANOVA). 

Reference genome database. A custom database was constructed from draft and 
finished reference genomes obtained from human-associated microbial isolates (538 
genomes from the Human Microbiome Project Data Analysis and Coordination 
Center (http://www.hmpdacc.org)), in addition to the Eggerthella lenta DSM2243 
reference genome. All predicted proteins from the reference genome database were 
annotated with KEGG orthologous groups (KOs) using the KEGG database (ver- 
sion 52; BLASTX e-value < 107°, bit score > 50, and >50% identity)’. For query 
genes with multiple matches, the annotated reference gene with the lowest e-value 
was used. When multiple annotated genes with an identical e-value were encoun- 
tered after a BLAST query, we included all KOs assigned to those genes. Genes from 
the database with significant homology (BLASTN e-value < 10° *°) to non-coding 
transcripts from the 539 microbial genomes were excluded from subsequent analysis. 
Metagenomic sequence analysis. DNA- and RNA-seq analysis was performed with 
our recently described pipeline*. Briefly, high-quality reads (see Supplementary 
Table 1 for sequencing statistics) were mapped using SSAHA2” to our custom 539- 
genome database and the Illumina adaptor sequences (SSAHA2 parameters: ‘-best 
1 -score 20 -solexa’). The number of transcripts assigned to each gene was then 
tallied and normalized to reads per kilobase per million mapped reads (RPKM). To 
account for genes that were not detected owing to limited sequencing depth, a pseu- 
docount of 0.01 was added to all samples. Samples were clustered in Matlab (version 
7.10.0) based on gene expression or abundance. Genes were grouped by KOs by 
calculating the cumulative RPKM for each sample for genes present in at least six 
samples. We used HUMAnN, a recently developed software package for metabolic 
reconstruction from metagenomic data”, followed by LefSe analysis to identify 
metagenomic biomarkers”. A modified version of the “SConstruct” file was used 
to input KO counts into the HUMAnN pipeline for each RNA-seq data set. We then 
ran LefSe on the resulting KEGG module abundance file using the “-o 1000000’ 
flag. We used the edgeR package” to identify orthologous groups with significantly 
altered abundance or expression. Prior to analysis, we calculated the cumulative 
number of sequencing reads assigned to each sample for each KO (without RPKM 
normalization). We then used a paired glm analysis to determine consistent changes 
within each animal following LPS treatment. Default parameters were used, with 
one exception: for the estimateGLMTrendedDisp step ‘min.n’ was set to 50/300 for 
the genomes and KO, respectively. Significance was accepted at a false discovery 
rate (FDR) < 0.05 and >2-fold change. 

PCR and RT-PCR. RNA was isolated from mouse tissues and gut contents by the 
guanidinium thiocyanate/caesium chloride gradient method*’. RNA was DNase 
treated (Sigma) and reverse transcribed with Superscript III (Invitrogen). Primer 
sequences were as follows: futlF, 5’-CAAGGAGCTCAGCTATGTGG-3’, fut1R, 
5'- GACTGCTCAGGACAGGAAGG-3’; fut2F, 5’-ACAGCCAGAAGAGCCAT 
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GGC-3’, fut2R, 5’-TAACACCGGGAGACTGATCC-3’; seclF, 5’-ATCCAAGC 
AGTGCTCCAGC-3’, sec1R, 5’-CAATATTCGCCCATCTGGTTC-3’; villinF, 5’-G 
CTTGCCACAACTTCCTAAG-3’, villinR, 5’-CTTGCTTGAAGTAGCTCCGG-3’. 
Quantitative RT-PCR was performed on an Applied Biosystems StepOnePlus instru- 
ment with Universal Sybr Green Universal Supermix (Bio-Rad), and the following 
primers”: E. coli fucA F, 5’-GGCGCGCAAGGAATAGAA-3’, E. coli fucA R, 5’- 
GATCCCCGCTATTCACTACATGA-3’; E. coli fucP F, 5’-CCAAATACGGTTC 
GTCCTTCA-3’, E. coli fucP R, 5’-ACCCATGACCGGAGTGACAA-3’; E. coli rpoA 
F, 5’-GCGCTCATCTTCTTCCGAAT-3’, E. coli rpoA R, 5'-CGCGGTCGTGGT 
TATGTG-3’. 

Statistics. Statistical analyses were performed with GraphPad Prism 5 software 
(GraphPad Software). 
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Extended Data Figure 1 | Requirements and kinetics for small intestine 
fucosylation induced by systemic injection of TLR ligands. a, Systemic 
injection of bacterial TLR ligands induces small intestine fucosylation, but 
simple starvation does not. UEA-1 staining (as in Fig. 1) after i.p. injection of 
CpG DNA, or Pam3CSK,, or food deprivation for 24h of BALB/c SPF mouse. 
b, LPS injection causes small intestine fucosylation in various inbred mouse 
strains. SPF mice of the indicated strains were injected with LPS i.p. and the 
small intestine was stained with UEA-1 after 24h, as in Fig. 1. c, Fucosylation 
peaks at 8h after LPS injection and is still detectable at 96h. d, M cells can be 
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readily detected by scanning electron microscope and UEA-1 staining of the 
domes of the Peyer’s patches, but are rare in the villi and are not massively 
induced in the villi by LPS injection. UEA-1 staining and scanning electron 
microscopy were performed on adjacent pieces from the proximal one-third of 
the small intestine. Scale bars = 100 um for UEA-1 staining, 50 um for 
scanning electron microscope images. e, Small intestine fucosylation does not 
require the presence of endogenous microbiota (LPS injection in GF mouse) 
and is not induced by oral administration of LPS (1 mg). All data are 
representative of at least two independent experiments. 
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Extended Data Figure 2 | MyD88-dependent pathway for fucosylation of 
small intestine IECs in response to systemic stimulation of TLRs. a, FACS 
analysis of IECs from three segments of small intestine from the indicated mice. 
Cells are gated on the FSC/SSC high epithelial cell population. At least two mice 
per mutant genotype were stained along with two control mice in the 


experiments shown. b, SPF mice were pre-treated with 20 mg streptomycin and 
orally infected with S. enterica Typhimurium. The small intestine was stained at 
24h post-infection. MyD88 expression was necessary in CD11c* cells but not 
villin* IECs for S. enterica Typhimurium-induced fucosylation. Data are 
representative of at least two independent experiments. 
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Extended Data Figure 3 | A proposed model for the mechanisms linking cell surface and later shed into the lumen. Fucosidase-expressing bacteria (blue) 
inducible fucosylation to the gut microbiota. Systemic microbial agonists liberate fucose residues, which they can utilize and share with other bacteria 
activate TLRs on CD1 1c’ dendritic cells (DCs), causing secretion of the lacking the fucose-cleaving enzyme. Bacterial metabolism of fucose potentially 
cytokine IL-23, which in turn stimulates RORyt-dependent ILCs to secrete produces metabolites such as short-chain fatty acids (SCFAs). Fucose also 
IL-22. IL-22 causes small intestine epithelial cells to upregulate Fut2. directly or indirectly downregulates virulence gene expression by pathobionts 
Fucosylated proteins are either secreted into the lumen or expressed on the (red) or bona fide pathogens”. 
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Extended Data Figure 4 | Consequences of LPS injection in Fut2-sufficient 
and Fut2-deficient BALB/c mice. a, Inflammatory cytokines IL-1, IL-6 and 
TNF-« were measured by ELISA in sera of mice before or 2h after injection 
with LPS (4h for IL-1). abx, mice on antibiotic water for 2 days before 
injection. Bars are mean + s.e.m.; ND, not detected. Data are combined from 
three experiments. b, Expression of ReglIIy (also regulated by the MyD88- 
IL23-IL22 pathway). Measurement by qPCR of reg3g gene expression in mid- 
small-intestine tissue, relative to gapdh (ddCt method). Numbers indicate 
mean fold change + s.e.m. in LPS-treated versus untreated mice. Differences 
between LPS-treated Fut2* and abx or Fut2™ levels are not significant 

(P > 0.05, two-tailed Student’s t-test). Data are combined from three 
experiments. c, Weight loss and recovery is not different in Fut2*’~ and 
Fut2~/~ mice after simple starvation (mean = s.e.m., P > 0.05 at all time 
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points, two-tailed Student’s t-test; NS, not significant). d, Lack of direct toxic 
effect of antibiotics (abx) measured as the weight loss of BALB/c GF animals 
treated with LPS i.p (mean + s.e.m., P > 0.05 by two-tailed Student’s t-test at all 
time points). Data are combined from two experiments. e-g, Similar total 
bacterial loads in Fut2*/~ and Fut2~‘~ mice before and after LPS injection and 
antibiotic treatment. Total bacterial loads in faeces were estimated by plating on 
aerobic (e) and anaerobic (f) non-selective media, and by qPCR for 16S gene 
copies (g). There were no significant differences between Fut2-sufficient (filled 
circles) and Fut2-deficient (open circles) mice before or after LPS treatment 
(two-tailed Student’s t-test). Circles indicate individual mice; horizontal lines 
indicate means; red circles indicate antibiotic-treated mice. Data are combined 
from three experiments. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 2 Gene = . 
Peptide source Function Location Abundance 
symbol 
Maltase glucoamylase mgam glycosidase apical membrane 73 
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Extended Data Figure 5 | Fucosylated protein in IECs and gut contents. conjugated to horseradish peroxidase (HRP). ¢, Identical gel stained with 
a, Proteins «(1,2)fucosylated in IECs after LPS injection identified by UEA-1 Coomassie blue for total protein content. d, Relative density of the boxed area of 
precipitation and mass spectrometry. Abundance is the number of peptide each lane from b divided by the relative density in c. e, UEA-1 staining of 


fragments attributed to each gene. b, IECs from Fut2* untreated, Fut2" LPS- luminal proteins as in Fig. 3c. Blot is overexposed to show absence of luminal 
treated, or Fut2~ untreated mice were isolated, and lysates separated by SDS- __fucosylated proteins in the LPS-treated, Fut2 mouse. b-e, Data are 
PAGE. «(1,2)fucosylated proteins were detected by blotting with UEA-1 lectin _ representative of two independent experiments. 
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Extended Data Figure 6 | Generation of fucose-sensing reporter bacterial 
strains. a, Reporter E. coli were grown to stationary phase in minimal medium 
containing 10 mM glucose and the indicated concentrations of L-fucose 
(asterisk indicates promoter-less vector), and GFP fluorescence was measured. 
b, Fucosidase activity is dramatically reduced after 2 days of antibiotics (abx) 
treatment but recovers after cessation of treatment. Measurement of total 
a.-L-fucosidase activity in faeces. Faecal supernatant was assayed for cleavage 
of 4-methylumbelliferyl-fucopyranoside substrate by fluorescence. n = 5 SPF 
antibiotics-treated, 3 GF mice. ¢, Faecal homogenates were plated anaerobically 
on BHIS agar containing 5-bromo-4-chloro-3-indolyl «-L-fucopyranoside, 
which forms a blue precipitate upon cleavage of the fucosyl residue. Both blue 
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and white colonies are present. d, Pure cultures of Bacteroides species were 
streaked on the same medium as in c. B. uniformis (left) is not predicted to carry 
an o-L-fucosidase gene, and remains white; B. acidifaciens (middle) and 

B. thetaiotaomicron (right) both express fucosidase activity and develop blue 
colonies. e, Loss of B. acidifaciens from the faeces of mice treated with 
antibiotics (Abx) in water (PCR for the gyrB gene). C—, water control; C+, 
B. acidifaciens genomic DNA. f, Summary of reporter E. coli experiments in 
SPF mice (representative experiment is shown in Fig. 3e). Points are mean GFP 
fluorescence from all reporter bacteria measured in each of three independent 
experiments (n = 65 bacteria per mouse; *P < 0.05, Student’s ¢ test). 
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Extended Data Figure 7 | Microbial community structure is impacted by point (before or after LPS exposure). Results are based on 180,000 randomly 


cohousing yet robust to host fucosylation and LPS exposure, whereas selected 16S rRNA gene sequences per sample. c, Microbial diversity as 
microbial gene expression depends on Fut2. a, Stable relative abundance of | measured by the Shannon diversity index (n = 178,100 sequences per sample). 
bacterial phyla across treatment groups and genotypes, as indicated by 16S Values are mean + s.e.m. (1 = 3 Fut2*, 4 Fut2™ mice per time point). d, KEGG 
rRNA gene sequencing. Values represent the mean abundance of phyla found modules and pathways expressed in microbiota at higher levels after LPS 

at >1% relative abundance in at least one sample. b, Unweighted UniFrac exposure in Fut2-positive (left) and Fut2-negative mice (right) (n = 3 per 


analysis of the gut microbiota of Fut2-deficient (no outline) and Fut2-sufficient | group; Humann/LefSe analysis; LDA >2). 
(black outline) mice. Points are coloured based on kinship and labelled by time 
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Extended Data Figure 8 | Lack of indicible fucosylation and small intestine 
colonization in C. rodentium-infected mice. a, C. rodentium causes no small 
intestine fucosylation in SPF mice at day (d)3, day 7, or day 12 post-infection 
(p.i.). b, Small intestine colonization by C. rodentium is low regardless of 


d12 pi. 
+LPS 


Fut2 expression and LPS treatment. Small intestine contents were removed 
by gentle squeezing, homogenized in PBS, and plated. Data are shown as 
mean + s.e.m.; n = 4. NS, not significant. Dotted line shows the limit of 
detection. Data are representative of two experiments. 
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Extended Data Figure 9 | Effect of exogenous fucose on caecal short-chain _ gavage leads to increased propionate production in SPF but not GF mice. 
Data are shown as mean = s.e.m. **P < 0.01, Student’s two-tailed t-test. ND, 


fatty acid levels. Caecal short-chain fatty acids were measured after gavaging 
starved mice with the indicated sugars (100 mM concentration). Fucose not detected. Data are combined from three experiments. 


©2014 Macmillan Publishers Limited. All rights reserved 


| sd Wal Be 8 


doi:10.1038/nature13612 


Enhanced neonatal Fc receptor function improves 
protection against primate SHIV infection 


Sung-Youl Ko', Amarendra Pegu', Rebecca S. Rudicell'+, Zhi- yong Yang't, M. Gordon Joyce’, Xuejun Chen!, Keyun Wang’, 
Saran Bao, Thomas D. Kraemer’, Timo Rath?, Ming Zeng", Stephen D. Schmidt’, John-Paul Todd', Scott R. Penzak*}, 
Kevin O. Saunders, Martha C. Nason®, Ashley T. Haase’, Srinivas S. Rao!, Richard S. Blumberg’, John R. Mascola! & Gary J. Nabel'+ 


To protect against human immunodeficiency virus (HIV-1) infec- 
tion, broadly neutralizing antibodies (bnAbs) must be active at the 
portals of viral entry in the gastrointestinal or cervicovaginal tracts. 
The localization and persistence of antibodies at these sites is influ- 
enced by the neonatal Fc receptor (FcRn)'”, whose role in protecting 
against infection in vivo has not been defined. Here, we show that a 
bnAb with enhanced FcRn binding has increased gut mucosal tissue 
localization, which improves protection against lentiviral infection 
in non-human primates. A bnAb directed to the CD4-binding site 
of the HIV-1 envelope (Env) protein (denoted VRCO1)* was modi- 
fied by site-directed mutagenesis to increase its binding affinity for 
FcRn. This enhanced FcRn-binding mutant bnAb, denoted VRCO1- 
LS, displayed increased transcytosis across human FcRn-expressing 
cellular monolayers in vitro while retaining FcyRIIIa binding and 
function, including antibody-dependent cell-mediated cytotoxicity 
(ADCC) activity, at levels similar to VRCO1 (the wild type). VRCO1- 
LS had a threefold longer serum half-life than VRCO1 in non-human 
primates and persisted in the rectal mucosa even when it was no lon- 
ger detectable in the serum. Notably, VRCO1-LS mediated protection 
superior to that afforded by VRCO1 against intrarectal infection 
with simian—human immunodeficiency virus (SHIV). These findings 
suggest that modification of FcRn binding provides a mechanism not 
only to increase serum half-life but also to enhance mucosal local- 
ization that confers immune protection. Mutations that enhance FcRn 
function could therefore increase the potency and durability of pas- 
sive immunization strategies to prevent HIV-1 infection. 
Antibody protection against viral infection is influenced by effector 
functions mediated by the Fc domain of the antibody interacting with 
activating Fe receptors (including FcyRIIIa-mediated ADCC’%) or in- 
hibitory Fc receptors on the surface of immune cells’. FcRn is a multi- 
functional receptor that plays a role in IgG transport and homeostasis’. 
FcRn binds to the Fe portion of IgG with high affinity at an acidic pH 
(<6.5). Following endocytosis, free IgG is degraded in the lysosome, 
while FcRn-bound IgG avoids degradation at low pH and is recycled 
into the extracellular space, thus prolonging the half-life of IgG’. Re- 
cently, several monoclonal antibodies with broad and potent neutral- 
izing activity against diverse HIV-1 Env proteins have been shown to 
confer passive protection against SHIV infection**”°. To determine 
whether the protective efficacy of such antibodies can be increased by 
modulation of FcRn effector function, we evaluated the role of muta- 
tions that increase binding to FcRn. To potentiate the effector function 
of the bnAb VRCO1, we introduced mutations in the CH2 and/or CH3 
domains of VRCO1 and analysed the binding affinity of these mutated 
antibodies for human FcRn and FcyRIa. Five mutants known to en- 
hance binding to human FcRn (Fig. 1a), along with a non-FcRn-binding 


mutant (IHH)!*""*, were characterized in vitro. Similarly to VRCO1, all 
mutants bound toa previously described HIV-1 Env protein, RSC3 (which 
consists of a resurfaced, stabilized core’ of HIV-1 glycoprotein (gp) 120) 
(Fig. 1b); this finding is consistent with previously published reports of 
the effects of these FCRn-binding mutations on other IgGs"*. All mutants 
also had similar neutralization potency and breadth to VRCO1 (Ex- 
tended Data Table 1). These data confirm that the FcRn-binding muta- 
tions did not interfere with the recognition of cognate antigen or with 
neutralization. 

We also compared the ability of the FcCRn-binding mutants to bind 
to human FcRn at physiological or endosomal pH (7.4 and 6.0, respect- 
ively). As expected, all mutants except VRCO1-IHH bound more strongly 
to FcRn at either pH than did VRCO1 (Fig. 1c). While the binding of the 
Fc mutants to human FcRn at pH 7.4 by enzyme-linked immunosor- 
bent assay (ELISA) was greater than that of VRCO1 at saturating anti- 
body concentrations (>10 jg ml '), differences in binding potency are 
best discriminated by using half-maximum binding concentrations (ECsp). 
Analysis of ECs values revealed greater binding by each of the FcRn- 
enabled mutants at pH 6.0 than at pH 7.4 (Extended Data Table 2), as 
expected and consistent with previous reports on the pH dependence 
of FcRn binding. It has been proposed that higher affinity binding to 
FcRn at pH 7.4 might inhibit the release of FCRn-bound IgG’’. The 
FcRn-binding mutants of VRCO1 were dissociated at pH 7.4 similar to 
VRCO1 (Extended Data Fig. 1), indicating that VRCO1 FcRn-binding 
mutations enhanced the pH-dependent binding but did not affect re- 
lease at physiological pH. 

FcRn affects the transport of IgG from the basolateral to the apical 
surface of mucosal epithelial cells’. We therefore compared the trans- 
port of VRCO1 and its FcRn-binding mutants across MDCK (Madin- 
Darby canine kidney) cells that express human FcRn and B-microglobulin 
in a transwell system in vitro. When the concentration of monoclonal 
antibodies in the upper (apical) chamber was measured 2 h after anti- 
body addition to the lower (basolateral) chamber, the transcytosis of 
all mutants with enhanced FcRn binding was at least 2.5-fold higher 
than that of VRCO1. The highest level of transcytosis was exhibited by 
the VRCO1-LS and VRCO1-YTE mutants (Fig. 1d). These results show 
that mutations which enhance FcRn binding increase the transport of 
VRCO01 ina functional cell culture system. To confirm this effect in vivo, 
we evaluated the pharmacokinetics of these antibody mutants in human 
FcRn-transgenic mice’ (n = 4 per group). Similar to the cell culture 
results, all mutants with enhanced binding to human FcRn had a longer 
half-life than VRCO1 (Extended Data Fig. 2). 

In addition to virus neutralization, another Fc effector function has 
been implicated in immune protection. ADCC, which is mediated by 
IgG binding to FcyRIIIa, can lyse infected cells*’’. Because enhanced 
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Figure 1 | In vitro evaluation of VRCO1 and its FcRn-binding mutants. 


a, The FcRn-binding mutations used in this study are shown. The IHH mutant 
does not bind to human FcRn, and the other mutations enhance binding to 
human FcRn. b, Binding to HIV-1 Env (RSC3, a resurfaced, stabilized core of 
HIV-1 gp120) was examined by ELISA. c, Binding to human FcRn was assessed 
at pH6.0 and pH 7.4 by ELISA. d, Transcytosis was assessed by adding 

1.0 1g ml’ of each monoclonal antibody (mAb) to the basolateral side of a 
monolayer of MDCK cells that express human FcRn, which were grown on a 
transwell filter plate, and incubating for 2 h. The concentration of each antibody 
on the apical side was measured by quantitative RSC3 ELISA, and the fold 
increase in antibody concentration over VRCO1 (WT) is shown. *, P< 0.001 
versus VRCO1, two-tailed Student’s t-test. e, Binding to human FcyRIIIa was 
examined by ELISA. f, The ability of each monoclonal antibody to induce 
ADCC was analysed using human peripheral blood mononuclear cells as 
effector cells and HIV-1-infected CEM-NK* cells as targets and staining with 
the fluorescent dyes PKH26 and carboxyfluorescein succinimidy] ester (CFSE). 
The percentage killing was calculated from the percentage of CFSE cells 
within the PKH26" population. Assays were performed in duplicate 

(b, ¢, e, f) or quadruplicate (d), and the data points represent the mean values 
(b, c, e, f) or the mean + s.e.m. (d). The data are representative of two 
independent experiments (b-f). A459, absorbance at 450 nm. 


FcRn-binding mutations might affect the interactions of IgG with FcyRIa 
and thereby alter ADCC activity’, we assessed the FcyRIIIa binding 
and ADCC effector function of each mutant. We performed ELISA 
binding assays with human FcyRIIIa and ADCC assays using human 
peripheral blood mononuclear cells as effector cells and HIV-infected 
CEM-NK®* cells (a natural killer (NK)-cell-resistant human T leukaemia 
cell line) as targets”. Most of the enhanced FcRn-binding mutants showed 
lower FcyRIIa binding and ADCC activity than VRCO1. Interestingly, 
only the VRCO1-LS mutant exhibited both FcyRII]a binding and ADCC 
activity comparable to VRCO1 (Fig. le, f). We further examined the bind- 
ing affinities of VRCO1-LS for human FcRn and FcyRIIla using surface 
plasmon resonance analysis. Consistent with the ELISA results, VRCO1- 
LS exhibited a 12-fold higher human FcRn binding affinity than VRCO1, 
whereas both displayed similar binding to human FcyRIIla (Table 1). 
These data suggested that the increased FcRn binding of VRCO1-LS 
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Table 1 | Comparison of binding affinities for human FcRn and 
FcyRilla 


Binding to bnAb Kon (1/nM Xs) Kost (1/5) Ky (nM) Fold Ky* 
Human FcRn (pH 6.0) WT 0.000094 0.2790 3,000 1.00 
LS 0.000223 0.0558 250 12.00 
IHH NB NB NB = 
Human FcyRIlla WT 0.000080 0.105 1,300 1.00 
Ls 0.000099 0.100 1,010 1.29 
IHH 0.000041 0.187 4,500 0.29 


* Fold change in Kg versus wild type (WT). 
WT, VRCO1; LS, VRCO1-LS; IHH, VRCO1-IHH (non-FeRn-binding mutant); Kon, association rate 
constant; Ko, dissociation rate constant; Kg, equilibrium dissociation constant; NB, not bound. 


did not affect its FcyRIIIa-mediated effector function. We further ex- 
amined the binding of these VRCO1 variants to two other functional 
IgG receptors, FcyRIIa and FcyRIIb. We found no difference in the 
binding of VRCO1 and VRCO1-LS to either receptor, as determined by 
ELISA (Extended Data Fig. 3), suggesting that VRCO1-LS does not differ 
from VRCO1 with respect to ADCC effector function or immune sup- 
pression through FcyRIa or FcyRIIb, respectively. 

Because its ADCC activity was not affected by the FcRn-binding mu- 
tation, we selected the VRCO1-LS mutant for further analysis in non- 
human primates. Its half-life in vivo was evaluated by injection (10 mg 
per kg body weight) and compared with VRCO1, and VRCO1-IHH, in 
Indian rhesus macaques (Macaca mulatta). The serum concentrations 
of each monoclonal antibody were measured by ELISA, and the half-lives 
were calculated using a two-compartment model (Fig. 2a). The LS mutant 
had a 2.5-fold longer half-life (VRCO1, 4.65 days; VRCO1-LS, 11.80 days) 
and a 2.5-fold slower clearance rate than the wild type (VRCO01, 16.52 ml 
day 'kg~*; VRCO1-LS, 6.47 ml day ‘kg '). By contrast, VRCO1-IHH 
displayed markedly less persistence than the wild type, with a half-life 
of <1 day. This result confirmed that the increase in FcRn binding of 
VRCO1-LS prolonged the antibody’s half-life in non-human primates. 

Since the gut mucosa is a primary site of HIV infection, we examined 
the accumulation of each monoclonal antibody in the rectal mucosa. 
Because direct tissue immunohistochemical staining is subject to sam- 
pling variation, we used a more quantitative approach: we quantified 
the total amount of VRCO1 or VRCO1-LS in rectal tissue homogenates, 
by using ELISA. Protein was extracted from rectal biopsy samples, and 
the amount of each monoclonal antibody was normalized to the total 
amount of tissue protein. After a single injection (10 mgkg'), VRCO1- 
LS remained detectable in rectal tissue for more than 70 days and per- 
sisted at significantly higher levels than VRCO1 (P < 0.001, two-tailed 
t-test), which was not detected after 28 days (Fig. 2b). VRC01-IHH was 
not detected in rectal tissues after 14 days. At lower concentrations of 
monoclonal antibody (0.3 mgkg '), a similar trend was seen in both 
the serum and the rectal mucosa. The levels of VRCO1 in rectal tissue 
peaked at day 2, whereas VRCO1-LS continued to increase up to day 5, 
after which it persisted at higher levels than VRCO1 (Fig. 3a). In parallel 
studies, we also compared the relative monoclonal antibody levels in 
rectal secretions and tissue in cynomolgus macaques (Macaca fascicu- 
laris). Similarly to rhesus macaques, VRCO1-LS circulated for threefold 
longer in the serum than did VRCO1 (Extended Data Fig. 4a). Compared 
with VRCO1, higher levels of VRCO1-LS were measured in both rectal 
secretions and tissue at 28 days post antibody infusion, although the 
rectal tissue sampling was more consistent than the secretions (Extended 
Data Fig. 4b, c). We therefore suggest that tissue IgG provides an impor- 
tant indicator of mucosal IgG localization, consistent with a previous 
report”. 

To determine whether VRCO1 co-localized with FcRn in vivo, rectal 
biopsy samples taken after a single injection were stained either with a 
polyclonal antibody specific for human FcRn or with an HIV-1 Env probe, 
RSC3 (ref. 3). FcRn co-localized with VRCO1 mainly in mucosal epi- 
thelial cells (Fig. 2c). Together, these data demonstrated that VRCO1- 
LS persisted longer in the serum than VRCO1 and accumulated in the 
mucosa as a result of its increased binding affinity for FcRn. 
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Figure 2 E Pharmacokinetic Gig in Indian rhesus macaques. a, The 
indicated VRCO1 variant monoclonal antibodies were injected intravenously at 
10 mg per kg body weight into rhesus macaques on day 0 (VRCO1 and VRCO1- 
LS, n = 4; VRCO1-IHH, n = 2), and the serum levels over time are shown. 
The pharmacokinetic parameters were calculated with a two-compartment 
model (VRCO1 versus VRCO1-LS, half-life, 4.7 versus 11.8 days; clearance, 16.5 
versus 6.5 ml day * kg; area under the curve (AUC), 632 versus 1,645 day 
X pgml~'). b, The amount of monoclonal antibody per mg of total tissue 
protein in rectal biopsy samples from each monoclonal-antibody-injected 
rhesus macaque (n = 4 per group) was quantitated and is shown over time. 
VRCO1-LS persisted at higher levels than did VRCO1 (P < 0.001, two-tailed 
t-test). The data points are the mean + s.e.m. c, Immunohistochemical staining 
of the rectal biopsy samples (n = 6 at 10 or 20mgkg ' dose) after VRCO1 
administration shows co-localization of VRCO1, as measured by indirect 
binding to an Env probe, RSC3 (VRCO1, red; top) and FcRn, stained with an 
anti-human FcRn polyclonal antibody (FcRn, red; bottom). The sections 
shown are representative of those observed in the six macaques. The adjacent 
sections were stained with RSC3 and the anti-human FcRn polyclonal antibody 
in test samples. Buffer without RSC3 and control rabbit IgG were used as 
negative controls for VRCO1 and FcRn staining, respectively. The arrows 
indicate VRCO1 or FcRn staining in the cytoplasm of epithelial cells. The images 
were captured at X20 original magnification, except for the right column, 
which shows magnified views of the centre column (X40 original 
magnification). 


To determine whether VRCO1-LS can protect against infection more 
effectively than VRCO1, we performed an intrarectal challenge with the 
CC-chemokine receptor 5 (CCR5)-tropic SHIV BaLP4. Rhesus maca- 
ques (n = 12 per group) were injected intravenously with a low, sub- 
optimally protective, dose (0.3 mgkg” ') of VRC01, VRCO1-LS or control 
IgG and challenged intrarectally with SHIV 5 days after passive anti- 
body transfer. All control-IgG-injected macaques became infected within 
3 weeks of SHIV challenge, as measured by plasma viraemia. Ten of 12 
macaques that received VRCO1 were infected, in contrast to 5 of 12 
VRCO1-LS-injected macaques (P = 0.0447, one-tailed Fisher’s exact 
test) (Fig. 3b and Extended Data Fig. 5). Among the infected animals, 
no significant differences in peak or set-point viraemia were observed 
between these experimental groups. VRCO1-LS therefore conferred su- 
perior protection to VRCO1 against acquisition of lentiviral infection. 

In this study, we show that modification of a monoclonal antibody- 
FcRn interaction through mutations in the Fc domain can increase the 
protection afforded by the antibody against mucosal lentiviral challenge. 
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Figure 3 | VRCO1-LS affords greater protection against intrarectal SHIV 
BaLP4 challenge than VRCO1. The indicated monoclonal antibodies were 
administered intravenously to rhesus macaques at a dose of 0.3 mgkg | on 
day 0. a, Monoclonal antibody levels per mg of total tissue protein in rectal 
biopsy samples (n = 4 per group) are shown over time. The data points are 
mean + s.e.m. b, The macaques (n = 12 per group) were challenged with 
SHIV BaLP4 on day 5 after monoclonal antibody administration, and the 
percentage of uninfected macaques was plotted over time. Normal human IgG 
was given to the control group. Five of 12 VRC01-LS-treated macaques and 
10 of 12 VRCO1-treated macaques were infected (P = 0.0447, one-tailed 
Fisher’s exact test). 


Passive infusion of anti-HIV-1 bnAbs has been shown to prevent mu- 
cosal SHIV infection”, and the pharmacokinetics and biodistribution 
of the bnAbs play important roles in this protection’'; however, the 
mechanism that confers mucosal protection has not been defined. Spe- 
cifically, whether antibody transport and FcRn effector function con- 
tribute to mucosal immunity in primates was previously unknown. Our 
work demonstrates that FcRn plays an important role in IgG transport 
and provides insight into the mechanism by which FcRn function facil- 
itates protection by IgG against mucosal viral infection. The increased 
FcRn binding of IgG induced by the LS mutation avoids endosomal 
degradation, allowing recycling into the serum, thereby increasing the 
serum antibody levels. The antibody retention in the rectal mucosa is 
probably due to antibody sequestration in the endosomes of epithelial 
cells (as indicated by the intracellular localization, Fig. 2c), which is pre- 
sumably followed either by recycling into the subepithelial mucosal 
tissue for interactions with professional antigen presenting cells or by 
transcytosis into the intestinal lumen. Importantly, this mutation also 
increased the retention of IgG in the intestinal mucosa at the portal of 
HIV-1 entry, where a higher local concentration can more effectively 
neutralize local infection. 

Our results suggest that active, FCRn-mediated transport, as opposed 
to simple diffusion, is a major component of IgG compartmentaliza- 
tion in the mucosal tissues of the rectum. FcRn-mediated transport of 
IgG to mucosal surfaces may play an unrecognized role in the ability of 
a variety of vaccines to successfully block mucosal viral infections. For 
example, vaccination with inactive poliovirus (also known as the Salk 
vaccine) generates an IgG response that can block poliovirus infection 
in the gut mucosa”. There are similarities, as well as potential differ- 
ences, between poliovirus and HIV-1 pathogenesis. Both poliovirus and 
HIV-1 invade the human body through mucosal tissues, especially the 
intestinal mucosa, but the receptors for cell entry are different. Polio- 
virus penetrates the body through M cells, in the Peyer’s patches™, where- 
as HIV-1 targets intraepithelial CD4* T cells in the mucosal epithelium. 
Although FcRn mediates the transcytosis of IgG across the intestinal 
epithelium in vivo as has been shown in a humanized mouse model”, 
our results demonstrate that FcRn is also a means to retain IgG in tis- 
sues. This latter effect may restrain the quantities of IgG that are trans- 
ported into the lumen and potentially explains why previous studies 
have failed to find an association between monoclonal antibody levels 
in mucosal secretions and protection”. Macaques with higher levels of 
bnAbs (VRCO1-LS) in rectal tissues had a lower rate of infection than 
macaques injected with VRCO1 (Fig. 3). Thus, it is important to mea- 
sure IgG in tissues, in addition to in mucosal secretions. 
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The sexual transmission of HIV-1 is initiated by the replication of one 
ora small number of virions that penetrate the vaginal or rectal mucosa”®. 
The FcRn-mediated anatomical localization of bnAbs to the rectal mu- 
cosa may more effectively block penetration at this site of entry. This 
localization, combined with the quantitative effects of FCRn on serum 
IgG levels, explains how increasing the affinity for FCRn improves bnAb 
efficacy. Notably, FcRn expression has also been reported in the female 
genital tract?”: FCRn knockout in this previous paper was associated 
with decreased IgG-mediated protection against intravaginal herpes sim- 
plex virus (HSV-2) challenge. Because the FcRn knockout effects are 
global, it is unclear whether this effect was mediated through a systemic 
or mucosal effect; nonetheless, together, these studies are complement- 
ary and support the idea that FCRn mediates mucosal protection by IgG. 
The potential role of FcRn at other mucosal sites is further supported 
by the observation that VRCO1 accumulates in vaginal tissues and lo- 
calizes to the basal and parabasal layers of the vaginal epithelium, simi- 
larly to FcRn, as determined by immunohistochemistry (Extended Data 
Fig. 6). Presumably, FcRn-mediated transport increases the monoclo- 
nal antibody levels in this tissue, where it might similarly confer pro- 
tection against infection. 

Altering the pharmacokinetic profile of antibodies by increasing half- 
life and tissue localization has practical implications for the use of pas- 
sive monoclonal antibody administration for immunoprophylaxis and 
therapy. One concern about the use of such monoclonal antibody muta- 
tions is whether they might elicit immune responses to the FcRn-binding 
mutation. We found no evidence of increased antibody responses to 
VRCO1-LS or specifically to the LS mutation relative to VRCO1 (Ex- 
tended Data Fig. 7). Because VRCO1 is human and xenogeneic to non- 
human primates, such questions about immunogenicity must eventually 
be evaluated in humans. In this regard, at least one human monoclonal 
antibody with an FcRn-binding-site mutation (motavizumab-YTE; 
M252Y / S254T / 'T256E) has been tested in healthy human adults and 
did not induce an appreciable increase in an anti-drug antibody res- 
ponse compared with the wild-type antibody”. 

Potent bnAbs with increased FcRn binding can be retained in rel- 
evant tissues for extended periods, decreasing the necessary amount of 
antibody, as well as the frequency of injections, thereby making the treat- 
ment or prevention of viral infections by passive antibody transfer more 
feasible than at present and broadly applicable. In addition to its poten- 
tial importance for the optimization of passive immune prevention of 
HIV-1 infection of humans, this mechanism is also likely to be relevant 
to antibody-mediated protection against other mucosal pathogens such 
as rotavirus, poliovirus, norovirus and influenza virus. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Generation and characterization of VRCO1 Fc mutants. The Fc mutations used 
in this study are indicated in Fig 1a. The amino acid positions are numbered ac- 
cording to the EU index of human IgGl (ref. 30). Mutations were introduced to the 
heavy chain gene of VRCO1 by site-directed mutagenesis using the QuikChange II 
Site-Directed Mutagenesis Kit (Agilent Technologies). Mammalian expression plas- 
mids encoding the heavy and light chain genes were transiently co-transfected into 
the human embryonic kidney cell line 293F, and the supernatant was harvested 5 days 
later. The monoclonal antibodies were purified from the supernatant by Protein G 
column chromatography (GE Healthcare). 

The glycosylation patterns of VRCO1 and VRCO1-LS were analysed by using 

nanoLC-ESI-MS/MS (nano liquid chromatography electrospray ionization mass 
spectrometry) peptide sequencing technology. In brief, each solution sample was 
cleaned and digested in-gel with sequencing grade modified trypsin (Promega). The 
resultant peptide mixture was analysed by an LC-ESI-MS/MS system, in which high 
performance liquid chromatography (HPLC) with a 75-,1m (inner diameter) reverse 
phase C18 column was coupled in-line with an ion trap mass spectrometer (Thermo 
Scientific). Comparing the glycosylation patterns of VRCO1 and VRCO1-LS by mass 
spectrometry, we focused on the glycan at amino acid position N297, which is the 
single N-glycosylation site in the Fc region of IgG1 isotype antibodies (such as VRCO1). 
A very similar pattern of complex glycan configuration was noted for both mono- 
clonal antibodies (Extended Data Table 3). G1F (GalGlcNAc,Man3;GlcNAc,Fuc) 
and GOF (GlcNAc,Man3GlcNAcFuc) are the major glycosylation species found in 
both VRCO1 and VRCO1-LS. While quantitative differences were noted in some 
glycan species, these differences represent small changes and fall within the limits 
of the expected experimental variability between preparations. More importantly, 
the LS mutation neither generated nor removed additional potential N-linked gly- 
cosylation patterns observed in VRCO1. 
ELISA. RSC3 ELISAs were used to analyse IgG binding. ELISA plates (96 well) were 
coated with 2 pg ml” * RSC3 in PBS, incubated at 4 °C overnight and blocked with 
PBS containing 5% BSA at room temperature for 1 h. Serial dilutions of each mono- 
clonal antibody in 2.5% BSA in PBS were added to the plates and incubated at room 
temperature for 1h. Horseradish peroxidase (HRP)-conjugated anti-human IgG, 
Fcy-specific (Jackson ImmunoResearch Laboratories), was added and incubated at 
room temperature for 1 h. Tetramethylbenzidine (TMB, Sigma) HRP substrate was 
added to each well, and the absorbance of the yellow colour that developed after 
adding 0.5 M H2SO, was measured at 450 nm. 

The binding of IgG to human FcRn or human Fcy receptors was examined as 
follows. His-tagged human FcRn was purified as previously described*’, and His- 
tagged human FcyRIIIa, FeyRIa and FcyRIIb were purchased from R&D Systems. 
Nickel-coated plates (96 well, Thermo Scientific) were coated with 2 1g ml! His- 
tagged human FcRn or human Fcy receptor and incubated at room temperature 
for 1h. The ELISA was performed as described above. To examine the binding of 
monoclonal antibodies to human FcRn at pH 6.0, PBS at pH 6.0 was used instead 
of PBS at pH 7.4. ECs9 determination for human FcRn binding was calculated by 
computer-assisted nonlinear fitting using GraphPad Prism (Graphpad Software). 

VRCO1 and its mutants were quantified as follows. ELISA plates (96 well) were 
coated with 2 pg ml * RSC3 in PBS, incubated at 4 °C overnight and blocked with 
PBS containing 5% BSA at room temperature for 1h. Sera were diluted in 2.5% 
BSA in PBS and added to the plate and incubated at room temperature for 1 h, and 
the ELISA was carried out as described above. Purified monoclonal antibodies were 
used as a standard, and the limit of detection was 0.4ng ml}. 

Anti-VRCO01 antibody responses were evaluated as follows. The sera from ma- 
caques that had been administered VRCO1 or VRCO1-LS were added to 96-well 
ELISA plates coated with 2 1g ml” ' VRCO1 or VRCO1-LS. To test VRCO1-LS -specific 
antibody responses, the sera from the two groups were absorbed with 20 pg ml"! 
VRCO1, and the ELISA was carried out against VRCO1-LS. The sera were diluted 
with TBS containing 5% skim milk, 2% BSA and 0.05% Tween 20, and the ELISA 
was performed as described above. 

Neutralization assays. Antibody-mediated neutralization of pseudotyped HIV-1 
isolates was measured using the TZM-bl luciferase reporter gene assay as described 
previously’? ™*. The target baseline infection level was ~200,000 relative light units. 
Neutralization curves were fit using a five-parameter hill-slope equation as described 
previously**. The concentrations of VRCO1 and its mutants that were required to 
inhibit infection by 50% or 80% are reported as 50% or 80% inhibitory concentra- 
tions (ICso or ICgo), respectively. 

Biacore binding assays. For surface plasmon resonance analysis, human FcRn and 
human FcyRIIa (5 tg ml’ in 10 mM acetate, pH 5.5, for 10 min ata 10 pl min’! 
flow rate) were immobilized on a Biacore CM5 Sensor Chip to an immobilization 
level of ~500 response units (RU). VRCO1 and its Fc mutants were injected for 2 min 
at various concentrations (0.01-10 1M) and allowed to dissociate for 3 min at a 
50 pl min“ ' flow rate. The human FcRn surface was regenerated by two sequential 
injections of 1X PBS (pH 7.4) for 1 minatan 80 pl min flow rate, while the human 


FcyRIIIa surface was regenerated with two sequential injections of 10 mM glycine 
(pH 2.1). All binding affinities were calculated using a 1:1 Langmuir fitting equa- 
tion and Scrubber 2 (BioLogic Software), with an average value determined over 
the range of concentrations measured. 

Animals. mFcRn~/~ hFcRn(276)'®78 mice were obtained from Jackson Labora- 
tories (stock number 004919). They were housed and bred in the animal facility of 
the Vaccine Research Center (VRC) at the National Institute of Allergy and Infec- 
tious Diseases. Macaca mulatta animals of Indian origin and Macaca fascicularis 
were used in the non-human primate study. All animal experiments were reviewed 
and approved by the Animal Care and Use Committee of the VRC, and all animals 
were housed and cared for in accordance with local, state, federal and institutional 
policies in a facility accredited by the American Association for Accreditation of 
Laboratory Animal Care at the National Institutes of Health. Animal numbers per 
group in this study were chosen in accordance with previously published papers'*’*"*. 
Animals were chosen and randomized based on age and weight. 

ADCC assay. ADCC assays were performed using a fluorescence activated cell 
sorting (FACS)-based method, as previously described”°. Briefly, HIV-IIIB-infected 
CEM NK®-CCRS cells were used as target cells and human peripheral blood mono- 
nuclear cells were used as effector cells. Target cells were double-stained with the 
fluorescent dyes PKH26 (Sigma) and carboxyfluorescein succinimidyl ester (CFSE; 
Invitrogen). Target cells (10,000) and serially diluted monoclonal antibody were 
added to each well of a 96-well plate, mixed thoroughly and incubated at room tem- 
perature for 15 min. Effector cells (500,000) were added to each well and mixed 
thoroughly. The plate was centrifuged for 3 min at 400 r.p.m. and incubated at 37 °C 
in a5% CO incubator for 4h. The cells were washed with PBS, fixed with 1% para- 
formaldehyde in PBS, and stored at 4 °C overnight. The percentage of CFSE cells 
within the PKH26™ population was obtained by using an LSR-II flow cytometer 
(BD Biosciences) and FlowJo software (TreeStar). Non-stained and single-stained 
targets were included in every experiment to compensate for single-stained CFSE 
and PKH26 emissions. 

Transcytosis assay. Assessments of transcytosis of VRCO1 and its Fc mutants were 
performed as previously described for IgG*’. Briefly, MDCK cells were transfected 
to express human B,-microglobulin and human FcRn or vector controls and were 
grown to confluence in DMEM containing 10% FBS and 1% penicillin and strep- 
tomycin on 12-mm transwells with a pore size of 0.4 tm (Corning) and allowed to 
polarize for 96 h. Twelve hours before the commencement of transcytosis experi- 
ments, the medium was changed to serum-free medium without antibiotics. On 
the day of the experiment, the measured transepithelial resistance ranged between 
150 and 200 Qcm’. Transwells were incubated in Hank’s balanced salt solution 
(HBSS), pH 7.4, for 20 min followed by equilibration at 36 °C and 5% CO, ina 12- 
well plate containing HBSS, pH 6.0, in the input chamber and HBSS, pH 7.4, in the 
output chamber. pH 6.0-adjusted VRCO1 variants at varying concentrations in HBSS 
were then directly added to the input chamber. After incubating for 2h at 36 °C 
and 5% COs, the medium in the output chamber was collected, and the mono- 
clonal antibody concentrations were quantified by ELISA. Mycoplasma-free stocks 
of transfected MDCK cells using the Venor GeM Mycoplasma Detection Kit 
(Sigma, MP0025) were maintained as aliquots and regularly thawed for use every 
3-4 weeks. 

Pharmacokinetic experiments in rhesus macaques. Female Indian rhesus maca- 
ques weighing between 2.9 and 6.7 kg were randomly assigned to groups according 
to body weight (four macaques per group for VRCO1 and VRCO1-LS, and two ma- 
caques per group for VRCO1-IHH) and were intravenously injected with 10 mg kg * 
monoclonal antibody. Blood was collected before injection on day 0, at 30 min and 
6h after injection, and at 1, 2, 4, 7, 14, 21 and 28 days after injection. The concen- 
tration of each monoclonal antibody was quantified by ELISA. Pharmacokinetic 
parameters were calculated with a two-compartment model using the WinNonlin 
software (Pharsight). 

Immunohistochemical staining. All staining procedures were performed as prev- 
iously described”, using 5-l1m tissue sections mounted on glass slides. Tissues were 
deparaffinized and rehydrated in deionized water. Heat-induced epitope retrieval 
was performed using a water bath (98 °C) in EDTA Decloaker (Biocare Medical), 
followed by cooling to room temperature. Tissue sections were then blocked with 
Sniper blocking reagent (Biocare Medical) for 30 min at room temperature. RSC3- 
biotin was diluted at 1:100 in TNB (0.1 M Tris-HCl, pH 7.5, 0.1 M NaCland 0.05% 
Tween 20 with DuPont Blocking Reagent) and incubated overnight at 4 °C. After 
the RSC3-biotin incubation, the sections were washed with PBS containing 0.1% 
Tween 20 and then incubated with anti-biotin antibody conjugated to alkaline phos- 
phatase in TNB for 2 h at room temperature. After the incubation with anti-biotin 
antibody, the sections were washed with PBS containing 0.1% Tween 20. Signal was 
detected with the Warp Red Chromogen Kit (Biocare Medical). The sections were 
counterstained with Harris Hematoxylin (Surgipath), dehydrated rapidly in a gradient 
of ethanols and mounted with Permount (Fisher Scientific). Stained sections were 
examined by light microscopy at ambient temperatures. Light micrographs were 
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taken using an Olympus BX60 upright microscope with the following objectives: 
X10 (0.3 NA), X20 (0.5 NA) and X40 (0.75 NA); images were acquired using a SPOT 
colour mosaic camera (model 11.2; Diagnostic Instruments) and SPOT acquisition 
software (version 4.5.9; Diagnostic Instruments). Anti-biotin antibody conjugated 
to alkaline phosphatase was used as a negative control antibody in all instances and 
yielded negative staining results. For staining of human FcRn, heat-induced epi- 
tope retrieval was performed in Diva Decloaker (Biocare Medical) at 96 °C for 20 min, 
and polyclonal anti-human-FcRn antibody(1:50 dilution; Sigma, HPA012122) and 
anti-rabbit IgG conjugated to alkaline phosphatase were used as primary and sec- 
ondary antibodies, respectively. 

Quantitation of monoclonal antibodies in rectal biopsies. As previously described”, 
rectal biopsy samples were added to 1.5-ml tubes containing 100 pl PBS with EDTA- 
free protease inhibitor (Roche). The samples were ground on ice for 30 s and centri- 
fuged at 4 °C for 20 min at 12,000 r.p.m., and the supernatants were collected. The 
total protein amounts were quantified by a BCA Protein Assay Kit (Thermo Sci- 
entific), and VRCO1 and its Fc mutants were quantified by ELISA. The monoclonal 
antibody amount per mg of total protein is shown. The trends over days 2-28 were 
compared between groups by assuming a linear model for each animal and then 
testing the parameters from the linear models for differences between groups using 
a two-tailed t-test. 

Challenge experiment in rhesus macaques. Male Indian rhesus macaques weigh- 
ing 3.2-4.8 kg were randomly assigned to groups according to body weight, intra- 
venously injected with 0.3mgkg~' each monoclonal antibody and intrarectally 
challenged with undiluted SHIV BaLP4 5 days after the monoclonal antibody injec- 
tion. Blood was collected twice a week for up to 4 weeks and then once a week for up 
to 12 weeks. Plasma was isolated from the blood samples. The plasma viral RNA 
levels were determined using a modified two-step quantitative reverse transcription 
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PCR process. Experimental samples were run in parallel with a simian immuno- 
deficiency virus (SIV) gag RNA standard on an Applied Biosystems StepOne Real- 
Time PCR System. The lower limit of detection using this assay was 250 copies of 
SIV RNA ml '. The forward primer used was 5'-GTC TGC GTC ATC TGG ATT 
C-3', and the reverse primer was 5’-CAC TAG GTG TCT CTG CAC TAT CTG 
TTT TG-3’. 
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Extended Data Figure 1 | Enhanced FcRn-binding mutants of VRCO1 are 
released from human FcRn at pH 7.4. VRCO1 or the indicated FcRn-binding 
mutants were injected in PBS (pH 6.0) at a concentration of 100 nM over 
human-FcRn-immobilized Biacore CM5 sensor chips (~500 response units 
(RU)) and were dissociated using PBS at pH 6.0 followed by PBS at pH 7.4. 
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Extended Data Figure 2 | Mutants with enhanced human FcRn binding 
persist at higher serum concentrations than VRCO1 in the human- 
FcRn(276)-transgenic mouse. The indicated VRC0O1-derived monoclonal 
antibodies (2 mg kg” ') were injected intravenously into 6-8-week-old 
human-FcRn(276)-transgenic mice (n = 4, male and female mice evenly 
distributed). Serum concentrations were assessed by indirect ELISA against 
RSC3 over time. The percentage of the monoclonal antibodies remaining in the 
serum is shown compared with the percentage on day 1 (set at 100%). 
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Extended Data Figure 3 | Binding of VRCO1, VRCO1-LS and other human Fc receptors was evaluated by ELISA as described in 


FcRn-binding mutants to FcyRIIa and FcyRIIb. Binding to the indicated Methods. 
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Extended Data Figure 4 | Serum pharmacokinetics, rectal tissue 
accumulation and mucosal secretion of VRC01-derived monoclonal 
antibodies in cynomolgus macaques. VRCO1 or VRCO1-LS (10 mgkg  *) 
were injected intravenously into cynomolgus macaques. a, The serum levels 
(n = 4) were analysed over time. b, c, The amounts of monoclonal antibody per 
mg of total tissue protein in rectal biopsy samples (b) and per mg of total IgG in 
rectal secretions (c) from each monoclonal-antibody-injected cynomolgus 
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macaque (n = 2 per group) were quantitated and are shown over time. The 
values for each macaque and the mean values for the groups are shown as 
dotted lines and heavier solid lines, respectively (b). Pharmacokinetic 
parameters were calculated with a two-compartment model (VRCO1 versus 
VRCO1-LS, half-life, 9.0 versus 30.3 days; clearance, 15.7 versus 

3.7 ml day ' kg; area under the curve (AUC), 896 versus 

2,812 day X pg ml’). 
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Extended Data Figure 5 | Viral load measurements over time after passive | SHIV BaLP45 days later, and the plasma viral loads were measured over time. 


antibody transfer and SHIV challenge of non-human primates. The Normal human IgG was given to the control group. Five of 12 VRC01-LS- 
indicated monoclonal antibodies (0.3 mg kg ') were administered injected macaques and 10 of 12 VRCO1-injected macaques were infected 
intravenously to rhesus macaques (n = 12). Macaques were challenged with (P = 0.0447, one-tailed Fisher’s exact test). 
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Extended Data Figure 6 | Accumulation and localization of VRCO1 in b, VRCO1 (20 mg kg ') was injected intravenously into rhesus macaques 
vaginal tissue. a, VRCO1 or VRCO1-LS (10 mgkg’ ') were injected (n = 2), and vaginal biopsy samples were taken and processed. 
intravenously into female cynomolgus macaques (n = 2). The monoclonal Immunohistochemical staining was performed before and after the antibody 


antibody concentration per mg of total tissue protein in vaginal biopsy samples _ dosing. The arrows indicate VRCO1 (RSC3 staining, red) in basal and parabasal 
was quantitated at the indicated times. The values for each macaque andthe __ epithelial cells. The sections are representative of the sections assessed for 
mean values for the groups are shown as dotted and solid lines, respectively. | two macaques. Scale bar, 50 jm. 
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VRCO1-LS in non-human primates. VRC01 or VRCO1-LS (10 mgkg~') was | VRCO1-LS were absorbed with 20 pg ml’ VRCO1, and binding to VRC01-LS 
injected intravenously into Indian rhesus macaques (n = 4). a, Anti- VRCO1 was then evaluated by ELISA. 
antibody responses (left) or anti-VRC01-LS antibody responses (right) were 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Comparative potency of HIV-1 neutralization by VRCO1 and enhanced FcRn-binding mutants 


ICgo (ug/mL)* 


WT LS QL A AAA YTE 
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$lVmac251.30 >50 >50 >50 >50 >50 >50 
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7165.18 
Du156.12 
ZM109.4 
SVA.MLV >50 >50 >50 >50 >50 >50 
$lVmac251.30 >50 >50 >50 >50 >50 >50 


* Values are colour coded according to their potency: white (>50 pg ml~*), orange (0.1-1.0 ng mI~?) and red (<0.1 pgm! 3). 
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Extended Data Table 2 | ECso values for the binding of VRCO1 and 
VRCO1-LS to human FcRn, as determined by ELISA 


EC50 (ug/mL) 
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pH7.4 
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Fold change 
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Extended Data Table 3 | Relative abundance of N297 glycosylation in VRCO1 and VRCO1-LS* 


Relative abundance (%) 
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eo 
GIcNAc2ManzGIcNAc2Fuc (GOF) finale. 87.6 
mo 


GalGIcNAc,Man3GIcNAc.Fuc (G1F) @~— < = Rea 12.4 
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Ls 


5.4 


23 


8.9 


* Glycosylation was analysed by nanoLC-ESI-MS/MS (nano liquid chromatography electrospray ionization mass spectrometry) peptide sequencing technology. 
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Protein machines are multi-subunit protein complexes that orches- 
trate highly regulated biochemical tasks. An example is the anaphase- 
promoting complex/cyclosome (APC/C), a 13-subunit ubiquitin 
ligase that initiates the metaphase-anaphase transition and mitotic 
exit by targeting proteins such as securin and cyclin B1 for ubiquitin- 
dependent destruction by the proteasome’. Because blocking mito- 
tic exit is an effective approach for inducing tumour cell death**, the 
APC/C represents a potential novel target for cancer therapy. APC/C 
activation in mitosis requires binding of Cdc20 (ref. 5), which forms 
a co-receptor with the APC/C to recognize substrates containing a 
destruction box (D-box)* *. Here we demonstrate that we can syner- 
gistically inhibit APC/C-dependent proteolysis and mitotic exit by 
simultaneously disrupting two protein-protein interactions with- 
in the APC/C-Cdc20-substrate ternary complex. We identify a small 
molecule, called apcin (APC inhibitor), which binds to Cdc20 and 
competitively inhibits the ubiquitylation of D-box-containing sub- 
strates. Analysis of the crystal structure of the apcin-Cdc20 complex 
suggests that apcin occupies the D-box-binding pocket on the side 
face of the WD40-domain. The ability of apcin to block mitotic exit 
is synergistically amplified by co-addition of tosyl-L-arginine methyl 
ester, a small molecule that blocks the APC/C-Cdc20 interaction’*"*. 
This work suggests that simultaneous disruption of multiple, weak 
protein-protein interactions is an effective approach for inactiva- 
ting a protein machine. 

We identified apcin (Fig. la) in an earlier study as an inhibitor of 
cyclin proteolysis in mitotic Xenopus egg extract’’, but its mechanism 
of action remained unknown. Analysis of the structure-activity rela- 
tionship (Fig. 1b and Extended Data Fig. 1a) revealed that elimination 
of one nitrogen in the pyrimidine ring of apcin (apcin-P) reduced ac- 
tivity slightly, but replacement with a morpholino group (apcin-M) elim- 
inated activity. In contrast, elimination of the nitro-imidazole moiety 
(apcin-A) had little effect. To identify the target of apcin, we coupled 
apcin-A to beads via its amino group, incubated the beads with mitotic 
Xenopus extract, then removed the beads. We found that Cdc20 was 
depleted from the extract (Fig. 1c), resulting in stabilization of a cyclin- 
B-luciferase reporter protein (Fig. 1d). Cyclin degradation was rescued 
by adding in vitro-translated Cdc20 (Fig. 1d), implicating Cdc20 as the 
target of apcin. Cdc20 binding to apcin-A beads could be competed by 
free apcin (Fig. le), but not by the inactive analogue apcin-M or the 
APC/C inhibitor tosyl-L-arginine methyl ester (TAME)'*’° (Extended 
Data Fig. 1b). A cyclin B1 amino (N)-terminal fragment (cycB1-NT) also 
competed for Cdc20 binding to the apcin-A resin, but the same frag- 
ment with a mutated D-box did not (Fig. 1f). Among a panel of WD40- 
containing proteins, Cdc20 binding to apcin-A beads was most robust, 
followed by Cdh1, with much less binding of other WD40-containing pro- 
teins observed (Extended Data Fig. 1c). Although we have not tested the 
ability of apcin to inhibit Cdh1-dependent ubiquitylation, apcin inhibited 


Cdh1-dependent proteolysis in interphase Xenopus extract less efficiently 
than Cdc20-dependent proteolysis in mitotic Xenopus extract (Extended 
Data Fig. 1d). Apcin bound to endogenous Cdc20 in Xenopus extract in 
a dose-dependent manner (Extended Data Fig. le) that correlated with its 
ability to inhibit formation of high-molecular-mass ubiquitin conjugates 
of cycB1-NT (Fig. 1g) or full-length cyclin B1 (Extended Data Fig. 2a). 
Kinetic analysis of a reconstituted APC/C-dependent ubiquitylation 
reaction’*® showed that apcin caused a significant increase (P = 0.0039) in 
the Michaelis constant, K,, (inhibition constant K, = 23 1M), but no reduc- 
tion in the catalytic rate constant k.., (Extended Data Fig. 2b). Together 
these results suggest that apcin competitively inhibits APC/C-dependent 
ubiquitylation by binding to Cdc20 and preventing substrate recognition. 

To identify the site on Cdc20 that binds apcin, we soaked apcin into 
Cdc20 protein crystals and determined the structure of the Cdce20-apcin 
complex to 2.1 A resolution (Extended Data Table 1). We found that 
apcin bound a small pocket on the side of the WD40 domain that has 
been implicated in binding the D-box (Fig. 2a, b and Extended Data 
Fig. 3a, b)'*'*"°. The binding mode of apcin is consistent with the 
structure-activity relationship, as the pyrimidine ring and aminal ni- 
trogens make hydrogen bonds with backbone atoms from D177. The 
hydrophobic trichloromethyl group is buried in the pocket occupied 
by leucine of the D-box (Extended Data Fig. 3b). The nitro-imidazole 
moiety is positioned facing solvent, explaining why apcin-A retains ac- 
tivity and can be used to isolate Cdc20 when coupled to beads. 

We tested whether mutations in the presumptive binding site af- 
fect the binding of Cdc20 to apcin-A beads (Fig. 2c and Extended Data 
Fig. 3c, d). V200 is located at the base of the hydrophobic pocket and 
mutation to methionine blocked binding of Cdc20 to apcin-A beads and 
inactivated the ability of Cdc20 to rescue cyclin degradation in a Cdc20- 
depleted extract. Mutation of D177, P179 or 1216, which line the binding 
pocket, also reduced binding to apcin and function of Cdc20. Mutation 
of R174, which lies near the pyrimidine ring of apcin, also reduced apcin 
binding and blocked Cdc20 rescue activity, consistent with a role of this 
residue in interacting with negatively charged amino acids in the D-box"™*. 
In contrast, mutation of E465, which interacts with the conserved argi- 
nine of the D-box™ but lies distant from the apcin-binding pocket, de- 
creased the ability of Cdc20 to rescue degradation, but had little effect 
onapcin binding. E180 lies further away and mutation to alanine had no 
effect on apcin binding or rescue activity. Overall, we observed a strong 
correlation between effects of mutations on apcin binding and their effects 
on Cdc20 function for residues that line the apcin-binding pocket. 

Cdc20 is recruited to the APC/C through multiple weak interactions”. 
For example, substrates can promote cooperative Cdc20 binding to 
the APC/C through a co-receptor interaction in which the substrate is 
simultaneously recognized by Cdc20 and the APC/C*™*. Consistent 
with this idea, we found that addition of substrate increased Cdc20 load- 
ing onto the APC/C in a concentration- and D-box-dependent manner 
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Figure 1 | Apcin binds to Cdc20 and inhibits APC/C-dependent 
ubiquitylation. a, Structures of apcin and derivatives. b, Effects of apcin and 
derivatives (200 1M) on proteolysis of an N-terminal fragment of cyclin B1 
(cycB1-NT) in mitotic Xenopus egg extract. Substrate levels were measured at 
40 min. N = 3 independent experiments. c, Apcin-A resin depletes Cdc20 
from mitotic Xenopus egg extract. Cdc20 and Cdc27 levels were measured by 
western blotting. d, Depletion with Apcin-A resin stabilizes a cyclin-luciferase 
reporter protein and degradation can be rescued by addition of in vitro- 
translated Cdc20. Substrate levels were measured at 60 min. N = 3 


in Xenopus extract (Fig. 3a and Extended Data Fig. 4a, b). Substrate- 
induced loading of Cdc20 could be blocked by addition of apcin (Fig. 3a 
and Extended Data Fig. 4a), indicating that the leucine-binding pocket 
of Cdc20 is critical for co-receptor function. Because the binding of 
Cdc20 to the APC/C was variable in the absence of added substrate, we 
were not able to assess the effects of apcin under this condition. Thus 
we cannot exclude the possibility that apcin might also decrease Cdc20 
binding to the APC/C in the absence of substrate. The small molecule 
TAME, which antagonizes the Ile-Arg (IR)-tail interaction between Cdc20 
and the APC/C'*"*, also antagonized Cdc20 loading. At high concen- 
trations of substrate, the combined use of apcin and TAME was more 
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independent experiments. e, Cdc20 expressed in reticulocyte lysate binds to 
apcin-A resin and can be competed by free apcin. Cdc20 was detected by 
western blotting. N = 3 independent experiments. f, Wild-type (WT) 
cycB1-NT, but not a D-box mutant (DBM, mutation of R42A, L45A), 
competes with Cdc20 binding to apcin-A resin. Cdc20 was detected by western 
blotting. N = 4 independent experiments. g, Apcin inhibits formation of 
high-molecular-mass ubiquitin conjugates of [°°S]cycB1-NT in mitotic 
Xenopus extract. Bars, mean + s.e.m. 


effective at blocking Cdc20 binding to the APC/C than either compound 
used alone, suggesting that simultaneous disruption of multiple inter- 
actions between substrate, Cdc20 and APC/C may be an effective strat- 
egy for inhibiting APC/C. 

Because substrates can be recruited to the APC/C through both D- 
box-dependent and -independent mechanisms, we compared the abil- 
ity of apcin to stabilize different APC/C substrates in mitotic Xenopus 
extract. Apcin stabilized cycB1-NT and securin most effectively (Fig. 3b 
and Extended Data Figs 4c and 5b), with somewhat weaker effects against 
full-length cyclin B1 (Fig. 3b and Extended Data Fig. 4c). Interestingly, 
even high concentrations of apcin failed to stabilize cyclin A2 or Nek2A 


Figure 2 | Apcin binds to the D-box binding site 
of Cdc20. a, Crystal structure of the apcin-Cdc20 
complex. Apcin atoms are labelled in yellow 
(carbon), blue (nitrogen), red (oxygen) and green 
(chlorine). Cdc20 is shown in magenta. Dotted 
blue lines indicate hydrogen bonds. b, View is 
rotated to show the position of V200 at the base of 
the hydrophobic binding pocket. c, Mutation of 
residues in the binding pocket reduces Cdc20 
binding to apcin-A resin (red bars) and the capacity 
of in vitro-translated Cdc20 protein to rescue 
cyclin-luciferase degradation in mitotic Xenopus 
egg extract immunodepleted of Cdc20 (blue bars). 
Bars, mean = s.e.m. of three independent 
experiments. P values were calculated by an 
unpaired t-test compared with WT. 
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(Fig. 3b and Extended Data Figs 4c and 5a), suggesting that apcin is highly 
specific for the D-box binding site. In contrast, TAME inhibited the 
degradation of all APC/C substrates tested (Fig. 3b and Extended Data 
Figs 4c and 5a, b), consistent with its ability to block recruitment of Cdc20 
to the APC/C directly’*'®. The ability of apcin to inhibit formation of 
high-molecular-mass substrate—ubiquitin conjugates closely correlated 
with its ability to stabilize each substrate (Extended Data Fig. 6). These 
results suggest that the leucine-binding pocket of Cdc20 plays an es- 
sential role in recruiting the D-boxes of securin and cycB1-NT, and that 
apcin can effectively compete with these interactions. The ability of full- 
length cyclin B1 and cyclin A2 to bind Cks1 via Cdk1 may facilitate their 
D-box-independent recruitment to the APC/C”*”, helping these sub- 
strates partially overcome the effects of apcin. Furthermore, the N- 
terminal region of cyclin A2 appears to bind Cdc20 with higher affinity 
than cyclin B1 (ref. 26), which may further reduce the effectiveness 
of apcin. In contrast to these substrates whose degradation is D-box- 
dependent, Nek2A is recruited directly to the APC/C via a Cdc20- 
independent mechanism that requires its Met-Arg (MR)-tail rather than 
a D-box””, explaining why apcin fails to inhibit ubiquitylation or deg- 
radation of this protein. 
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Because apcin and TAME inhibit APC/C-dependent proteolysis by 
distinct mechanisms, we tested the effect of combining the inhibitors 
on the proteolysis of APC/C substrates in Xenopus extract. The com- 
bination of apcin and TAME led to synergistic stabilization of cyclin B1, 
cycB1-NT, securin and cyclin A2, with a much weaker effect for Nek2A 
(Fig. 3c and Extended Data Figs 4d and 5a, b). For example, combining 
TAME and apcin at 25 UM each was more effective at stabilizing cyclin 
B1 than using either compound alone at 100 UM. Apcin slightly enhanced 
the ability of TAME to stabilize Nek2A (Fig. 3c), suggesting that the 
leucine-binding pocket of Cdc20 may bind Nek2A, even though this 
interaction is not essential for proteolysis if the APC/C is not otherwise 
perturbed. 

We next examined the effect of apcin, proTAME” (a cell-permeable 
TAME prodrug), and the combination, on mitotic exit in four different 
human cell lines. Apcin and proTAME synergized to increase the mitotic 
fraction in all cell lines examined (Fig. 4a and Extended Data Fig. 7a, b). 
Apcin-M was inactive whereas apcin-P retained activity (Extended Data 
Fig. 7c), consistent with effects on Cdc20 binding and APC/C-dependent 
proteolysis in Xenopus extract. In live-cell imaging experiments in RPE1 
cells (Fig. 4b, Extended Data Fig. 8a—d and Supplementary Videos 1 and 2), 
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Figure 4| Apcin synergizes with proTAME to prolong mitotic duration. 

a, RPE1 cells were treated with indicated concentrations of apcin and 
proTAME for 18 h, fixed and the mitotic index determined by automated high- 
throughput imaging. The panel shows the difference between the mitotic 
indices calculated by a Bliss-independence model compared with a synergy 
model; any positive value indicates synergy. *P < 0.05 on the basis of analysis of 
four technical replicates. b, Asynchronous RPE1 cells expressing H2B-green 
fluorescent protein (GFP) were transfected with short interfering RNA (siRNA) 
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and 24h later treated with apcin or apcin-M (25 uM) and/or proTAME (6 1M). 
Cells were then imaged every 6 min for 45h. Mitotic duration and cell fate 
(Extended Data Fig. 8b) were determined by manual inspection of the videos 
and plotted as inverse cumulative frequency (—proTAME) or Kaplan-Meier 
curves (+proTAME). The hatch marks on the Kaplan-Meier curves indicate 
censored cells that did not exit mitosis or die in mitosis before the end of the 
movie or before they migrated out of the field of view. Graphs include the 
combined results of two independent experiments. 
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25 LM apcin had no detectable effect on mitotic duration (P = 0.279) 
unless mitosis was artificially shortened by depletion of the spindle as- 
sembly checkpoint (SAC) protein Mad2 (P = 0.0001). In contrast, in 
the presence of proTAME, the addition of apcin dramatically slowed the 
rate of mitotic exit in a synergistic manner (Fig. 4b and Extended Data 
Fig. 8a—d): the rate of mitotic exit was 63% of that predicted by a mul- 
tiplicative combination of the single compound effects (P = 0.016). Sig- 
nificant synergy was also observed in U2OS cells (P = 2.0 X 10 *; Extended 
Data Fig. 8e). Addition of the inactive derivative apcin-M had no effect 
on the rate of mitotic exit in the presence of proTAME (P = 0.68; Fig. 4b 
and Extended Data Fig. 8a—e). The response of cells to proT AME alone 
was biphasic, because prolongation of metaphase can cause cohesion 
fatigue*’, which reactivates the SAC to block mitotic exit in a subpo- 
pulation of cells. Notably, the addition of apcin eliminated the biphasic 
response. Furthermore, the combined effect of apcin and proTAME was 
largely preserved when the SAC was inactivated by Mad2 depletion. 
When modelled quantitatively, the degree of synergy between apcin 
and proTAME was in fact enhanced in the absence of Mad2, as the rate 
of mitotic exit was reduced to 38% of the rate predicted by a multiplic- 
ative combination of the single compound effects (P = 8.92 x 10>; Ex- 
tended Data Fig. 8d). Together these findings suggest that synergistic 
inhibition of mitotic exit does not rely on the SAC, but instead prob- 
ably arises from direct pharmacological APC/C inhibition. 

The discovery of apcin and the elucidation of its mechanism validate 
the importance of the leucine-binding pocket in Cdc20 as a D-box co- 
receptor. Apcin preferentially stabilizes APC/C substrates whose degra- 
dation is D-box dependent. However, on its own, low concentrations of 
apcin do not effectively block mitotic exit, either because substrates can 
outcompete apcin binding to the leucine-binding pocket, or because 
substrates can be recruited to the APC/C through other mechanisms. 
The effectiveness of apcin can be dramatically enhanced by the addi- 
tion of TAME, which blocks Cdc20 loading through a distinct mech- 
anism, highlighting the importance of multiple weak protein-protein 
interactions in promoting activator binding and efficient substrate ubi- 
quitylation (Extended Data Fig. 9). Our work highlights the possibility 
of disrupting the function of a protein machine by simultaneously in- 
hibiting multiple protein-protein interactions. Because dynamic pro- 
tein complexes regulate virtually all aspects of cell biology, simultaneous 
targeting of multiple weak interactions may represent a new opportun- 
ity for therapeutic targeting of protein complexes that may otherwise be 
difficult to inhibit with a single compound. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Reagents. Commercial antibodies used for western analysis were as follows: anti- 
Cdc27 (610455, BD Transduction Laboratories), anti-Cdc20 (BA8; sc-93399, Santa 
Cruz Biotechnology and NB 100-2646, Novus Biologicals) to recognize Xenopus 
Cdc20, anti-Cdc20 (H-175; sc-8358, Santa Cruz Biotechnology) to recognize human 
Cdc20, and anti-HA-peroxidase (3F 10; 12013819001, Roche). Secondary antibodies 
used included anti-rabbit IgG-HRP (NA934; GE Healthcare) and anti-mouse IgG- 
HRP (NA931; GE Healthcare). For APC/C immunopurification from Xenopus 
extract, anti-Cdc27 (AF3.1; sc-9972, Santa Cruz Biotechnology) was used. For immu- 
nodepletion of Xenopus Cdc20, a rabbit polyclonal antibody was generated by Yenzym 
by immunization with an N-terminal fragment of Cdc20 (residues 1-170; tagged at 
the carboxy (C) terminus). Chemicals used were cycloheximide (239764, Calbiochem), 
calcium ionophore A23187, free acid form (100105, Calbiochem), tosyl-L-arginine 
methyl ester (T4626, Sigma), proTAME (I-440, Boston Biochem), MG262 (I-120, 
Boston Biochem), apcin (T0506-3874, Enamine), apcin-P (Amb2237944, Ambinter) 
and apcin-M (Amb1395012, Ambinter). Apcin and apcin-A were also synthesized 
by Sundia Meditech according to the methods described in the Supplementary 
Information. 

Assessment of substrate degradation in Xenopus egg extract. Use of female 
Xenopus laevis to produce cytoplasmic egg extracts was approved by the Harvard 
Medical School Standing Committee on Animals (protocol number 03231). Inter- 
phase Xenopus egg extract was prepared from eggs laid overnight according to the 
protocol of ref. 31 with the exception that eggs were activated with 2 1g ml! cal- 
cium ionophore (A23187) for 30 min before the crushing spin. Extract was frozen 
in liquid nitrogen and stored at —80 °C. Interphase extract was induced to enter 
mitosis by addition of non-degradable cyclin B (MBP-A90) at 20 pg ml! and in- 
cubated at 22-24 °C for 30-60 min. MBP-A90 consists of a fusion of the maltose- 
binding protein (MBP) to Xenopus cyclin B1 lacking its N-terminal 90 amino acids” 
and was expressed in Escherichia coli by inducing cultures at an attenuance (D600 nm) 
of 0.6 with 300 UM isopropylthiogalactoside (IPTG) for 5 h at 22-24 °C. Purification 
followed New England BioLabs protocol. To promote degradation in interphase 
extract, Cdh1 protein, expressed in baculovirus, was added to extract at a final con- 
centration of 50 nM in the presence of 75 |M roscovitine. Roscovitine addition is 
necessary to suppress inhibitory phosphorylation of Cdh1 by Cdk1. Extract was then 
pre-treated with drug (dimethylsulphoxide (DMSO), apcin and/or TAME) for 15 min 
at 22°C before addition of substrates. 

Substrates consisted of human full-length cyclin B1, cyclin A2, securin, Nek2A 
or an N-terminal fragment (residues 1-88) of human cyclin B1 (cyc B1-NT). Each 
substrate was amplified with primers by PCR to allow T7-dependent transcription 
of the PCR product. Substrates were expressed and labelled with [*°S]methionine 
(Perkin Elmer NEG709A500UC) using the TNT system (Promega). To measure 
degradation of substrates, extract was pre-treated with DMSO or test compounds 
for 10 min in the presence of 100 1g ml! cycloheximide to prevent re-incorpora- 
tion of free labelled amino acid. The in vitro translation reaction was then added to 
the Xenopus extract at 10% final volume. Extract was then incubated at 24 °C, with 
shaking at 1,250r.p.m., with samples taken at indicated times. Reactions were 
quenched with sodium dodecyl sulphate (SDS) sample buffer and processed for 
SDS-polyacrylamide gel electrophoresis (PAGE) and phosphor imaging (Bio-Rad 
PMI); quantification was performed using Quantity One software (Bio-Rad). 
Assessment of substrate ubiquitylation in mitotic Xenopus egg extract. Inter- 
phase Xenopus egg extract was supplemented with MBP-A90 to promote entry into 
mitosis. Mitotic extract was then treated for 30 min at 24 °C with 20 1M ubiquitin- 
vinyl sulfone (UbVS; U-202, Boston Biochem) to suppress deubiquitylation and 
with 100 pg ml“! cycloheximide to prevent re-incorporation of free labelled amino 
acid. Subsequently, mixtures containing apcin or DMSO, as indicated, proteasome 
inhibitor MG262 (150 11M) and wild-type ubiquitin (44 1M) were added to UbVS- 
treated extract and incubated for additional 10 min, at 24 °C with agitation. In vitro 
translation reactions expressing human full length cyclin B1, cyclin A2, securin, Nek2A, 
or cycB1-NT with [*°S]methionine labelling were also pre-treated with UbVS at 
20 uM and added to pre-treated extract at 10% final volume. Extract was then in- 
cubated at 24 °C, with shaking at 1250 r.p.m., and substrate ubiquitylation mon- 
itored by taking samples at indicated times. Reactions were quenched with SDS 
sample buffer and processed for SDS-PAGE and phosphor imaging (Bio-Rad PMI), 
and quantification was performed using Quantity One software (Bio-Rad). 
Measurement of the apcin-Cdc20 interaction in Xenopus egg extract using 
cellular thermal shift assay. The cellular thermal shift assay (CETSA) method’ 
was adapted to examine apcin engagement of endogenous Cdc20 in Xenopus egg 
extract. Interphase extract was diluted tenfold and incubated with various concen- 
trations of apcin dissolved in DMSO in a total volume of 200 tl, with a final DMSO 
concentration of 1%. After 10-min incubation at 22-24 °C, 50 ul of the lysate was 
transferred into PCR tubes (20170-012, VWR) and heated in a PCR machine (Mas- 
tercycler gradient, Eppendorf) for 3 min at 46 °C, followed by cooling for 3 min 
at 22-24 °C. These conditions were established in preliminary experiments as the 


temperature that yielded the greatest degree of Cdc20 precipitation that could be 
rescued by apcin treatment (data not shown). The heated lysates were centrifuged 
at 14,000 r.p.m. (20,000g) for 20 min at 4 °C to separate the soluble fractions from 
precipitates. Twenty microlitres of the supernatants were mixed with SDS sample 
buffer and the fraction of soluble Cdc20 was analysed by SDS-PAGE and anti-Cdc20 
western blot. Quantification of soluble Cdc20 used Fuji Imager LAS3000 and Image] 
software. Soluble Cdc20 levels were normalized to soluble Cdc20 in samples treated 
with the highest concentration of apcin. 

Measurement of Cdc20 binding to APC/C in Xenopus egg extract. To examine 
levels of Cdc20 associated with APC/C, the APC/C was immunopurified from mito- 
tic Xenopus egg extract. For 100 pil extract, 2 ug of anti-Cdc27 antibody (AF3.1, 
Santa Cruz Biotechnology) was cross-linked to 5 ul of Affiprep Protein A beads 
(156-0006, Bio-Rad) and incubated for 1h at 4°C. Apcin, TAME or DMSO was 
mixed with extract upon addition to anti-Cdc27-Affiprep Protein A beads in the 
presence or absence of cycB1-NT containing a HA-tag at the N terminus and His 
tag at the C terminus, as previously described™. After incubation with extract, beads 
were washed quickly three times with 20-fold volume of XB (10 mM potassium 
HEPES, pH 7.7, 100 mM KCl, 0.1 mM CaCl, 1 mM MgCl,) and combined with 
SDS sample buffer. For analysis of Cdc20 binding to APC/C, samples were pro- 
cessed for SDS-PAGE and immunoblotting against Cdc20 and APC/C subunit 
Cdc27. Chemiluminescence was imaged on a Fuji LAS 3000 with Image Reader 
LAS-3000 software. Levels of Cdc20 were quantified using ImageJ and data nor- 
malized to respective Cdc27 levels. 

Coupling of apcin-A to affigel-10 resin. Affigel-10 resin (153-6099, Bio-Rad) was 
washed twice with DMSO and dried. The resin was then mixed with 5 mM or 15 mM 
apcin-A dissolved in DMSO (2X volume of dry resin). N,N-diisopropylethylamine 
was diluted 50-fold into the solution. The resin was rotated at 22-24 °C for 2h and 
the reaction was quenched with 1/5 resin volume of ethanolamine. The resin was 
then washed sequentially with isopropanol, water and XB + 0.05% Tween. The 
resin was stored at 4°C as a 50% slurry in XB + 0.05% Tween. 

Cdc20 depletion by apcin-A resin. For a round of depletion of Cdc20 from mito- 
tic Xenopus extract, apcin-A resin (15 mM coupling) was incubated with extract at 
4°C rotating for 30 min. The volume of resin used was 40% of the extract volume. 
Three rounds of depletion were performed. To rescue degradation in the depleted 
extract, reticulocyte lysate containing in vitro-translated human Cdc20 or control 
reticulocyte lysate, treated as for a translation reaction but with no DNA template, 
was added to the extract at 1/10th extract volume. 

Assay for Cdc20 binding to apcin-A resin. Human Cdc20 in pCS2 vector was 
mutated at the various residues described with the QuikChange Site-Directed Muta- 
genesis Kit (Agilent Technologies) and custom primers for each sequence. All mu- 
tations were confirmed by sequencing. For a single pull-down assay, 5 pil of apcin-A 
resin (5 mM coupling) was incubated with 30 il diluted in vitro-translated human 
Cdc20 or other WD40 proteins (5 pl reticulocyte lysate diluted to 30 pil with XB + 
0.05% Tween) at 24°C with shaking (1500 r.p.m.) for 30 min. Competitors were 
pre-incubated with Cdc20 in reticulocyte lysate for 2 min at 22-24 °C before add- 
ing to apcin-A resin. Bound and input Cdc20 were detected by western blotting 
(Fig. le, f). Cdc20, other WD40 proteins, ODC and Cdc20 mutants in Fig. 2c and 
Extended Data Figs 1b, c and 3d were labelled with [*°S]methionine and detected 
by phosphorimaging. 

Assessment of kinetics of APC/C-dependent ubiquitylation in a reconstituted 
system. Measurements of kinetics of ubiquitylation used cycB1-NT and APC/C 
isolated from Xenopus extract, exactly as previously described"*, in the presence or 
absence of 50 uM apcin. The K; was calculated on the basis of the assumption of 
a competitive inhibition model according to the equation Ky, apcin = Km, untreated 
(1 + [Z]/K;), where [I] = 50 UM Ky, apcin (773 NM) and Ky, untreated (245 nM) were 
the average values from three independent experiments. The P value was calcu- 
lated by an unpaired t-test. 

Antibody-based depletion of Cdc20 from Xenopus extract. Cdc20 antibody, co- 
valently coupled to Affiprep protein-A beads as described’’, was incubated with 
mitotic extract at 4 °C with rotation for 30 min. The volume of antibody beads used 
was 20% of the reaction volume. Three rounds of depletion were performed, with 
separation of extract from beads after each round by centrifugation in spin columns 
(89868, Thermo-Pierce). 

Luciferase assay. A fusion of the N-terminal domain of cyclin B1 to luciferase’” was 
added to mitotic extract at 4 jig ml’ (Fig. 1d) or to interphase extract at 250 ug ml" 
for 10 min at 22-24 °C then diluted toa final concentration of 4 1g ml’ in mitotic 
extract (Fig. 2c). The extract was incubated at 22-24 °C and 3 1] samples were taken 
at 0, 20, 40 and 60 min. The samples were mixed quickly with 30 1] luciferin assay 
buffer (270 1M coenzyme A, 20 mM tricine, 3.67 mM MgSO,, 0.1 mM ethyledia- 
minetetraacetic acid (EDTA), 33.3 mM dithiothreitol (DTT), 530 1M ATP and 
470 «tM luciferin, pH 7.8), and the level of luminescence was measured on a Wallac 
1420 multilabel counter. 
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Protein purification and crystallization. The coding region of human Cdc20 con- 
taining residues 161-477 (Cdc20-WD40) was amplified by PCR and cloned into 
the modified pFastBac vector. Recombinant baculovirus encoding the N-terminal 
His6-tagged Cdc20 protein was constructed using the Bac-to-Bac system (Invitro- 
gen) according to the manufacturer’s protocols. A tobacco etch virus protease cleav- 
age site was introduced into the N terminus of Cdc20. Sf9 insect cells were infected 
with the Cdc20 baculovirus and harvested at about 60 h post-infection. His6-tagged 
Cdc20-WD40 was purified with Ni°*-NTA agarose resin (Qiagen) and cleaved 
with tobacco etch virus protease to remove the Hisg-tag. The Cdc20-WD40 pro- 
tein was further purified by anion exchange chromatography with a Mono-Q col- 
umn followed by size exclusion chromatography with a Superdex 200 column (GE 
Healthcare). Purified Cdc20-WD40 was concentrated to 4-5 mg ml‘ in the Super- 
dex 200 column buffer containing 25 mM Tris (pH 8.5), 150mM NaCl, 1mM 
MgCh, 5% glycerol and 5 mM tris(2-carboxyethyl)phosphine. 

The Cdc20-WD40 protein was crystallized at 20 °C using the sitting-drop vapour- 
diffusion method with a reservoir solution containing 0.1 M 2-(N-morpholino)ethane- 
sulfonic acid (MES, pH 6.5), 15% (w/v) polyethylene glycol (PEG) 6000 and 5% 
MPD. The Cdc20-WD40 crystals were transferred to a new 2 ul sitting drop witha 
reservoir solution containing 0.1 M MES (pH 6.5) and 20% (w/v) PEG 6000 and 
soaked for 3-5 h to remove bound MPD. Apcin compound was dissolved into DMSO 
and added to the same drop at the final concentration of 5 mM. After overnight 
soaking, the crystals were cryo-protected in a solution containing 0.1 M MES (pH 6.5), 
20% (w/v) PEG 6000, 10% glycerol and 5 mM apcin, then flash-cooled in liquid 
nitrogen. Crystals diffracted to a minimum Bragg spacing (dmin) of about 2.1 A and 
exhibited the symmetry of space group P2, with cell dimensions of a = 41 A, b= 
87 A,c=48A and B = 113°, and contained one Cdc20 molecule per asymmetric 
unit. 

Data collection and structure determination. Diffraction data were collected at 
beamline 19-ID (SBC-CAT) at the Advanced Photon Source (Argonne National 
Laboratory) and processed with HKL3000 (ref. 35). Phases were obtained by mole- 
cular replacement with Phaser using the crystal structure of human Cdc20-WD40 
(Protein Data Bank accession number 4GGC) as search model”. Iterative model 
building and refinements used COOT and Phenix, respectively*’**. The final model 
for Cde20-WD40-apcin (Rwork = 16.5%, Ree = 21.3%) contained 313 residues, 82 
water molecules and one apcin molecule. MolProbity was used for structure valid- 
ation to show that all models had good geometry, except for one surface residue that 
was an outlier in a Ramachandran plot”. Data collection and structure refinement 
statistics are summarized in Extended Data Table 1. 

High-throughput image-based assay to measure mitotic fraction. Parental A549, 
U2O0S and hTERT-RPE] cells were purchased from ATCC. DLD-1 cells were pur- 
chased from Sigma. For hTERT-RPE1, A549 and U20S cells, stable cell lines expres- 
sing H2B-GFP were derived using described methods” and used in the experiments. 
DLD-1 cells were used without further modification. Cell lines were tested for my- 
coplasma contamination (Lonza kit LT07-218) after they were derived and were 
found negative. For each cell line, asynchronous cells were re-suspended to a den- 
sity of 3.75 X 10° cells per millilitre. A WellMate dispenser (Thermo Scientific) was 
used to distribute 40 jl of suspension to each well of a black, clear-bottom 384-well 
plate (3712, Corning). Plates were sealed with breathable white rayon sealing tape 
(241205, Nunc) during plating and subsequent incubation. After 24h incubation, 
the cells were treated with indicated concentrations of apcin and proTAME dis- 
solved in DMSO, in four technical replicates. After 18 h, cells were fixed and stained 
directly without wash steps to avoid loss of mitotic cells, by adding 10 pl of 6X con- 
centrated fixing/staining reagent (60% formalin, 0.6% Triton X-100 and 1.5 pg ml 
Hoechst 33342 in DPBS). The plates were sealed with aluminium sealing tape (276014, 
Nunc) and incubated at 22-24 °C for 40 min before imaging. Plates were then imaged 
at four positions per well using an ImageXpress Micro (Molecular Devices) high- 
throughput microscope, with a X10 objective, yielding a total of 16 images per 
condition (four images X four replicates). Cell images were processed automat- 
ically in Image] to identify the nuclei, count the number of nuclei and determine 
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the maximum fluorescence intensity of each nucleus in each image. The output 
files from ImageJ for each treatment were pooled and the cumulative frequency 
curves of maximum intensity for the cell population in each treatment were com- 
puted using Matlab. An intensity threshold was set on the basis of the mitotic frac- 
tion in the wells treated with DMSO to separate mitotic cells from interphase cells. 
The interphase fraction for each treatment was indicated by the fraction below the 
threshold on the cumulative frequency plot. Methods for statistically analysing the 
data are presented in Supplementary Information. 

Fluorescence live-cell imaging. For experiments with RPE1-H2B-GFP cells, asyn- 
chronous RPE1-H2B-GFP cells were plated in 24-well glass-bottom plates (Greiner 
BioOne, 662892) 18-24 h before siRNA transfection using RNAiMax (Invitrogen). 
Cells were transfected with Mad 2 siRNA (GGAACAACUGAAAGAUUGGAdTdT, 
synthesized by Dharmacon) or non-targeting control siRNA (D-001210-01-20, 
Dharmacon) at a final concentration of 20 nM. Twenty-four hours after transfec- 
tion, cells were treated with compounds and imaging was initiated following com- 
pound treatment. For U2OS-H2B-GFP cells, the cells were first synchronized by 
double thymidine block (18 h first block, 8 h release, 17 h second block; thymidine 
concentration 2 mM). Compounds were added at 7 h after release from the second 
block, and imaging was initiated after compound treatment. To measure efficiency 
of Mad2 knockdown by siRNA, western blot samples were each prepared from a 
single well of the 24-well glass-bottom plates. Twenty-four hours after transfection, 
cells were collected by trypsinization, pelleting and re-suspension in 2X NuPAGE 
sample buffer (Invitrogen) + 50mM DTT. 

For imaging, plates were inserted into a covered chamber supplied with humidi- 
fied 5% CO, and mounted onto a motorized microscope stage (Prior Scientific). 
Differential interference contrast and fluorescence images were captured at 6 min 
intervals for 45 h using a Nikon Ti inverted fluorescence microscope fitted with a 
37 °C enclosed incubation chamber and using a X20 Plan Apo 0.75 numerical aper- 
ture objective lens. A Hamamatsu ORCA cooled CCD (charge-coupled device) cam- 
era collected the images with 2 X 2 binning using Nikon Elements software (version 
3.0). Videos were manually analysed using Nikon Elements software or Image]. 
Mitotic duration was defined as the time from nuclear envelope breakdown until 
anaphase, in the case of normal mitosis, or until exit from prolonged mitosis as in- 
dicated by cytoplasmic blebbing accompanied by changes in chromatin as detected 
by H2B-GFP. Cell fate was scored as ‘division’ if two daughter cells were produced 
by mitosis of any duration, and ‘abnormal exit’ ifa single cell of interphase appear- 
ance resulted after that cell was in a mitotic state of any duration. The ‘death’ fate 
describes cells that entered mitosis and died while in mitosis. Methods for statist- 
ically analysing live cell imaging data are presented in Supplementary Information. 
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Extended Data Figure 1 | Apcin binds Cdc20 and inhibits APC/C- 
dependent proteolysis in Xenopus extract. a, Effects of apcin and derivatives 
(200 1M) on degradation of a cyclin B1 N-terminal fragment (cycB1-NT) in 
mitotic Xenopus egg extracts. Substrate was expressed in reticulocyte lysate 
and labelled with [°°S]methionine. Samples were analysed by SDS-gel 
electrophoresis and phosphorimaging. Quantitation of the 40-min time point 
from three independent experiments is shown in Fig. 1b. b, Apcin-M and 
TAME do not inhibit binding of Cdc20 to apcin-A resin. The experiment was 
performed as shown in Fig. le except that the inactive apcin derivative apcin-M 
or the Cdc20-IR tail antagonist TAME were also tested and [?°S]Cdc20 was 
detected by autoradiography. c, Apcin-A interacts strongly with Cdc20, 
weakly with Cdh1, and shows little interaction with other WD40-domain 
proteins. Proteins were expressed in reticulocyte lysate and labelled with 
[°°S]methionine. Left panel, mean value for the percentage bound on the basis 
of three experiments (error bars, s.e.m.). Right panel, representative 
autoradiograph of one of three experiments. b, bound; i, 5% input; c, bound to 


empty resin (control). d, Apcin inhibits degradation of cycB1-NT in Cdh1- 
treated interphase extract. The addition of roscovitine (75 1M) is required to 
inhibit Cdk1 activity that suppresses Cdh1-dependent proteolysis. Note that 
100 UM apcin is less effective at stabilizing the substrate in Cdh1-activated 
interphase extract than in mitotic Xenopus extract (Fig. 3b). e, Apcin binds to 
endogenous Cdc20 in Xenopus extract as measured by a thermal shift assay”. 
Extract was incubated with varying concentrations of apcin, heated to 46 °C for 
3 minutes, and precipitated proteins removed by centrifugation. The soluble 
fraction was analysed by SDS-PAGE and western blotting for Cdc20. The left 
panel shows the percentage of soluble Cdc20 (mean + s.e.m. from three 
independent experiments). Western blot from one of three experiments is 
shown below. For comparison, total Cdc20 from interphase extract (and 
various dilutions) is shown. Right panel, Coomassie-stained gel of soluble 
proteins, indicating that there is not an observable non-specific stabilization 
of proteins induced by apcin addition. 
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Extended Data Figure 2 | Apcin acts as a competitive inhibitor of APC/C- 
dependent ubiquitylation. a, Apcin inhibits formation of high-molecular- 
mass ubiquitin conjugates of full-length cyclin B1. Mitotic Xenopus extract was 
pre-treated with the deubiquitinating enzyme inhibitor ubiquitin vinyl sulfone 
(UbVS, 20 1M) and proteasome inhibitor MG262 (150 1M) to stabilize 
ubiquitin conjugates. [°°S]cyclin B1 was added together with ubiquitin (44 1M) 
and samples analysed by SDS-PAGE and phosphorimaging. Right panel shows 
quantitation of the experiment. b, Apcin acts as a competitive inhibitor of 
APC/C-dependent ubiquitylation. APC/C was purified from mitotic Xenopus 
extracts and the initial rates of ubiquitylation of HA-tagged cycB1-NT were 
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measured in the presence of methylated ubiquitin to prevent ubiquitin chain 
elongation. The reaction was stopped at 45s and the products were detected 
by anti-HA blot. The left panel shows the anti-HA blot from one experiment; 
substrate concentrations were 62.5, 125, 250, 500 and 1,000 nM (left to right). 
Asterisk indicates an SDS-resistant aggregated form of substrate. Quantitation 
of three independent experiments, and a summary of kinetic parameters, 

are shown. Note that the effects of apcin in this reconstituted assay performed 
under initial rate conditions appear distinct from those obtained in crude 
Xenopus extract. See Supplementary Discussion for a more detailed discussion 
of these differences. 
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Extended Data Figure 3 | Structure of the apcin-Cdc20 complex. a, The 
F, — F, omit electron density map at the contour level of 3¢ is shown. The 
density for the bound apcin conformation is unambiguous and is consistent 
with the structure-activity analysis and mutagenesis experiments. b, Overlay of 
the structure of the Cdc20-apcin structure with the structure of a D-box- 
containing protein bound to Cdh1 (ref. 14). The trichloromethyl group of apcin 
projects into a hydrophobic pocket that is occupied by the leucine of the RXXL 
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motif of the D-box. The position of the arginine from the RXXL motif suggests 
a role for E465 of Cdc20 in a charge-based interaction with the D-box, 
consistent with our data that the E465S mutation disrupts the ability of Cdc20 
to promote substrate degradation more than it perturbs apcin binding. 

c, Depletion with anti-Cdc20 antibody covalently coupled to protein A beads 
depletes endogenous Xenopus Cdc20 from mitotic extract. d, Example 
autoradiogram of data shown in Fig. 2b. 
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Extended Data Figure 4 | Effects of apcin on Cdc20 binding to APC/C and _ western blotting against Cdc20 and Cdc27. Levels of Cdc20 were quantitated 


stability of APC/C substrates in mitotic Xenopus extract. a, Example using Image] and normalized to APC/C subunit Cdc27. c, Analysis of effects 
western blot for data shown in Fig. 3a. b, Substrate-mediated recruitment of | of apcin and TAME on APC/C substrate degradation in mitotic Xenopus 
Cdc20 to APC/C in Xenopus extract is dependent on the D-box motif. extract. Levels of *°S-labelled substrates were assessed by SDS-PAGE and 
Increasing concentrations of wild-type or different D-box mutants of cycB1l- —_ phosphorimaging. Asterisk represents a non-specific band. Images show one 
NT were introduced into mitotically arrested Xenopus extract and the APC/C _ of three experiments quantitated in Fig. 3b. d, Experiment performed as 

was isolated with anti-Cdc27 antibodies by immunoprecipitation for 1h at in c, but examining the combined effects of apcin and TAME. Image shows 


4°C, The immunoprecipitate was separated by SDS-PAGE and analysed by one of three experiments quantitated in Fig. 3c. 
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Extended Data Figure 5 | Effects of apcin and TAME on stability of APC/C 
substrates in mitotic Xenopus extract. Apcin and TAME synergize in 
stabilizing cyclin A2 (a) and securin (b) in mitotic Xenopus extract. Levels of 
*°§-labelled substrates were assessed by SDS-PAGE and phosphorimaging. 


experiments. 
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The change in mobility of securin between 0 and 20 min is probably a result of 
mitotic phosphorylation. Error bars in a represent mean and s.e.m. of three 
independent experiments. Data in b are representative of two independent 
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Extended Data Figure 6 | Apcin has differential effects on substrate 
ubiquitylation in mitotic Xenopus extract that correlate with effects on 
proteolysis. To examine the profile of ubiquitylated species generated in 
Xenopus extract, deubiquitylation and proteasome-mediated degradation were 
inhibited by pre-treatment with the general deubiquitinating-enzyme inhibitor 
ubiquitin-vinyl sulfone (UbVS; 20 11M), as established previously*’, and the 
proteasome-inhibitor MG262 (150 LM). Next, wild-type ubiquitin (Ub; 44 1M) 
and apcin (100 1M) or DMSO were added. *°S-labelled APC/C substrates 
expressed in reticulocyte lysate were introduced into treated extract and their 
ubiquitylation at indicated times assessed by SDS-PAGE and phosphor 
imaging. Levels of substrates that were not modified with ubiquitin 
(unconjugated), modified with one to three ubiquitins (substrate-Ub, Ub = 3) 


Time (min) 


or modified with more than three ubiquitin moieties (substrate-Ub, Ub > 3) 
were quantitated by phosphorimaging and plotted relative to radiolabelled 
protein in the respective region of the gel at 0 min in the apcin or DMSO 
sample. a, cycB1-NT; b, cyclin B1; ¢ Nek2A; d, cyclin A2; e, securin. Note that 
for cyclin A2, apcin reduces the amount of Ub conjugates with very high 
molecular mass (as indicated by inspection of the gel image) but does not 
reduce the fraction of conjugates modified with more than three ubiquitins (as 
indicated by the quantitation). These results are consistent with the inability of 
apcin to stabilize cyclin A2 in proteolysis assays, given that the proteasome 
typically requires at least four ubiquitin molecules to be attached to a 
substrate for efficient proteolysis. 
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Extended Data Figure 7 | Apcin and proTAME synergize to block mitotic 
exit in human cells, as measured in a fixed cell assay. a, Summary of the fixed- 
cell imaging assay and data processing methods to determine synergy between 
apcin and proTAME. See methods for detailed description of the assay. b, First 
column: primary data plotted as a heat map displaying mean fraction below 
threshold for each drug treatment concentration in each of four cell lines. This 
threshold is established on the basis of the mitotic index of DMSO-treated cells. 
Note that a high value in this column indicates a low mitotic index. Effects of 
single drugs alone are highlighted in pale purple. Second column (labelled 
‘Synergy model’): calculated effect of the combination of drugs on the basis ofa 
model that permits synergistic interaction between proTAME and apcin. 
Because this panel shows calculated values of combination effects, the effects of 
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individual drugs are not shown. Note that the synergy model closely parallels 
the actual data shown in the first column. Third column: calculated effect of the 
combination of drugs on the basis of a model that permits only multiplicative 
interaction between proTAME and apcin (Bliss model). Because this panel 
shows calculated values of combination effects, the effects of individual drugs 
are not shown. Note that the Bliss model does not closely parallel the actual 
data shown in the first column. Fourth column: heat map of the difference 
between the synergy and Bliss model predictions shows the degree of synergy at 
each drug dose combination (same as Fig. 4a for RPE] cells). *P < 0.05 for 
analysis of four technical replicates. See Supplementary Information for details 
of the statistical analysis. c, Activity of apcin derivatives in the fixed cell assay 
described in a, using RPE1 cells. 
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Extended Data Figure 8 | Apcin and proTAME synergize to block mitotic 
exit in human cells, as measured in a live cell assay. a, Western blot of Mad2 
knockdown in RPE] cells by siRNA from one of the experiments shown in 
Fig. 4b. b, Analysis of cell fate in RPE] cells for the experiment shown in Fig. 4b. 
c, Combined data from two independent experiments in RPE1 cells shown in 
Fig. 4b. ‘Cells’ is the total number of cells analysed, while “Fates observed’ is the 
subset of cells whose fate was observed, excluding cells that migrated out of 
view during the movie or were still arrested at the end of imaging. The median is 
the time on the x axis of the Kaplan-Meier curve corresponding to 0.5 on the 
y axis ‘Fraction of cells in mitosis’. d, Statistical modelling of data from 
experiment in Fig. 4b (RPE1 cells). The two tables on the left show the rate of 
mitotic exit relative to DMSO control for each of two siRNA treatment 
subgroups (control siRNA, left; Mad2 siRNA, middle), the P value for the 
comparison to DMSO (Cox proportional hazards model, see Supplementary 
Methods) and the P value for the comparison of proTAME with or without 
apcin-M. To determine P values for the pairwise comparisons of proTAME 
versus apcin-M with proTAME, we fitted a Cox proportional hazards model 
similar to that described in the Methods but with just an apcin-M effect and 
analysed just the subset of cells that were treated with either proTAME only or 


apcin-M with pro AME. The table labelled ‘Synergy between apcin and 
proTAME shows the rate of mitotic exit with both compounds (apcin with 
proTAME), relative to what would be predicted by a multiplicative 
combination of the effects of each compound alone. The magnitude of the 
synergy is roughly doubled when checkpoint activity is reduced by Mad2 
siRNA. e, Synchronized U2OS H2B-GFP cells were treated with apcin or 
apcin-M (25 uM) and/or proTAME (12 |1M). Cells were then imaged every 6 or 
10 min for 45h. Mitotic duration and cell fate were determined by manual 
inspection of the videos and plotted as Kaplan-Meier curves. The hatch marks 
on the Kaplan-Meier curves indicate mitotic duration endpoints of censored 
cells. Graphs include the combined results of five independent experiments. 
The U2OS model differs from the model used to test RPE1 data in that the 
U20S analysis is not stratified by either date or person, and the U2OS data 
do not include an effect of apcin-M alone (in the absence of proTAME). 
Pairwise comparison between proTAME and proTAME with apcin-M was 
tested using a Cox proportional hazards model stratified by date, using 

data from only the experimental blocks in which both proTAME and 
proTAME with apcin were tested. 
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Extended Data Figure 9 | Model of the effects of apcin and TAME on 
formation of the APC/C-Cdc20-substrate ternary complex. a, Schematic 
drawing of core APC/C subunits. Not all subunits are indicated, and not all 
known interactions between subunits are illustrated for the sake of simplicity. 
The light blue oval and polygon indicate binding sites for Cdc20 on core 
APC/C subunits. b, In the absence of substrate, Cdc20 (green) can bind to the 
APC/C via the C-box (labelled ‘C’), which interacts with APC8, and the IR-tail 
(labelled ‘IR’), which interacts with APC3 (Cdc27). c, Binding of substrates 
(red) that contain a D-box (labelled ‘D’) can promote formation of a 
co-receptor interaction between the WD-40 domain of Cdc20 and Apcl0. 


APC7 


APC3 


Dp PAPC10 


APC4,5 


IR 
Apcs WD-40 


APC8 ie; 


APC7 


APC3 
IR 


APC6 WD-40< D J APC10 


APC8 Cc Ub) apc 


APC4,5 


cl 
7 Te) 
J Ay Asy 
H oH 
apcin (A) 
IR 


Apce WD-40 


APC8 Cc 


APC7 


APC3 


APC10 


APC11 APC11 


APC4,5 


d, The RING-containing subunit APC11 can recruit the E2 enzyme to 
conjugate ubiquitin to the substrate. e, TAME binds APC3 to interfere with the 
IR-tail binding site. f, Apcin binds to the leucine pocket of the WD-40 domain 
of Cdc20. g, In the presence of TAME (labelled “T’), the IR-binding site is 
disrupted, but Cdc20 can still be recruited to the APC/C through the C-box 
interaction and co-receptor interaction. h, Apcin (labelled ‘A’) can disrupt the 
D-box interaction between the substrate and Cdc20, but Cdc20 can still interact 
through the C-box and IR-tail interactions. i, Combined use of apcin and 
TAME disrupts both interactions, cooperatively disrupting the interaction 
between APC/C, Cdc20 and substrate. 
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Extended Data Table 1 | Data collection and refinement statistics for Cdc20-apcin structure 


Data collection 


Space group 
Cell dimensions 
a,b,c (A) 
B(*) 
Resolution range (A) 
Rmerge (%) 
Wo(l) 
Data completeness (%) 
Redundancy 
Energy (eV) 
Unique reflections 
Wilson B-value (A?) 


Refinement statistics 


P2, 


41.04, 87.18, 48.04 
112.72 
40.00 — 2.10 (2.13 — 2.10) 
8.20 (38.9) 
13.7 (1.8) 
92.8 (58.5) 
3.5 (1.6) 
12,684.1 
16,859 (522) 
27.8 


Resolution range (A) 
No. of reflections Ryork/Rrree 


Data completeness (%) 


Atoms (non-H protein/solvent/inhibitor) 


Rwork (%) 

Riree (%) 

R.m.s.d. bond length (A) 
R.m.s.d. bond angle (°) 


Mean B-value (A’) (protein/solvent/inhibitor) 


Ramachandran plot (%) 
(favored/additional/disallowed)” 


Maximum likelihood coordinate error 


Missing residues, by chain 


28.58 — 2.10 (2.23 — 2.10) 
16,859/857 (1,915/100) 
92.4 (67.0) 
2,430/81/27 
16.5 (21.4) 

21.3 (29.1) 

0.002 
0.73 


29.6/30.0/51.1 


96.5/2.9/0.6 
0.20 


A: 161 — 164 


Data for the outermost shell are given in parentheses. °As defined by the validation suite MolProbity. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


1 sid al Be 8 


doi:10.1038/nature13671 


Transcriptional interference by antisense RNA is 
required for circadian clock function 


Zhihong Xue’, Qiaohong Ye', Simon R. Anson’, Jichen Yang’, Guanghua Xiao’, David Kowbel*, N. Louise Glass‘, 


Susan K. Crosthwaite* & Yi Liu! 


Eukaryotic circadian oscillators consist of negative feedback loops 
that generate endogenous rhythmicities’. Natural antisense RNAs 
are found in a wide range of eukaryotic organisms” °. Nevertheless, 
the physiological importance and mode of action of most antisense 
RNAs are not clear®’. frequency (frq) encodes a component of the 
Neurospora core circadian negative feedback loop, which was thought 
to generate sustained rhythmicity’. Transcription of qrf, the long 
non-coding frq antisense RNA, is induced by light, and its level os- 
cillates in antiphase to frq sense RNA’. Here we show that qrftran- 
scription is regulated by both light-dependent and light-independent 
mechanisms. Light-dependent qrftranscription represses frq expres- 
sion and regulates clock resetting. Light-independent qrf express- 
ion, on the other hand, is required for circadian rhythmicity. frq 
transcription also inhibits grf expression and drives the antiphasic 
rhythm of qrf transcripts. The mutual inhibition of frq and qrf 
transcription thus forms a double negative feedback loop that is in- 
terlocked with the core feedback loop. Genetic and mathematical mod- 
elling analyses indicate that such an arrangement is required for robust 
and sustained circadian rhythmicity. Moreover, our results suggest 
that antisense transcription inhibits sense expression by mediating 
chromatin modifications and premature termination of transcrip- 
tion. Taken together, our results establish antisense transcription as 
an essential feature in a circadian system and shed light on the im- 
portance and mechanism of antisense action. 

The transcription factors WHITE COLLAR (WC)-1 and -2 form a 
complex that activates frq transcription in the dark (DD) and mediates 
light-induced frq transcription for light-resetting of the clock by bind- 
ing to light-responsive elements (LREs) on the frq promoter’. 3 rapid 
amplification of cDNA ends (RACE) and RNA sequencing showed that 
frg and qrf transcripts overlap almost completely (Fig. 1a and Extended 
Fig. 1a). In the wc mutants, frq expression was nearly abolished but qrf 
transcript was observed at ~25% of wild-type levels (Fig. 1b), indicating 
that both WC-dependent and WC-independent mechanisms mediate 
qrf transcription. WC complex binds to the qrf promoter region™. frq 
constructs with point mutations in each of the five putative binding 
sites in the qrf promoter were individually introduced into a frq’° (frq 
and qrf null) strain’. Mutation of only one site, qLRE, markedly de- 
creased the qrf level (Fig. 1c and Extended Data Fig. 1b-d). In the frqr 
strain, the qrflevel was comparable to that of the we mutant. Ina qLRE 
knock-in strain (frqS"('8"™; Extended Data Fig. le), qrf levels were 
also much lower than in the control knock-in strain, and WC binding 
at the qrf promoter was completely abolished (Fig. 1c, d and Extended 
Data Fig. 1f), indicating that qLRE is the only WC binding site in the 
qrf promoter. 

Coinciding with low levels of qrf in the qLRE mutant, higher than 
wild-type levels of frqg messenger RNA and FRQ protein were observed 
(Fig. 1c and Extended Data Fig. 1d, f, g). frq and qrf are rapidly light- 
induced in the wild type (Fig. le) but in the frq’°;frq? “""™ strain, where- 
as qrf induction was completely abolished (Fig. 1f), induction of frq in 


the light was significantly elevated when compared with the wild type, 
indicating that light induction of qrf represses light-induced frq. Cir- 
cadian conidiation rhythms of qLRE mutant strains in DD were near 
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Figure 1 | WC-mediated light-induction of qrfrepresses frq to regulate light 
resetting of the clock. a, Diagram showing the frq locus. E, EcoRV site; 

C, Clal site. b, Strand-specific analyses by quantitative PCR with reverse 
transcription (RT-qPCR), showing the levels of frq and qrf in constant light 
conditions (LL). frq-RT and qrf-RT are control reactions without reverse 
transcriptase. Error bars show s.d. (n = 3). c, Northern blot analyses showing 
the expression of frq and qrf in LL. d, WC-2 ChIP assays showing the WC 
binding levels at the qLRE of grf promoter in LL. e, Strand-specific RT-qPCR 
analyses showing the levels of frq and qrf after 2 min of light induction at DD24 
in the wild type (WT). The numbers on the x axis represent the time points 
(minutes) in the dark after light induction. f, Strand-specific RT-qPCR analyses 
showing the levels of frg and qrf after 2 min of light induction at DD24 in 
the indicated strains. The numbers on the x axis represent the time points 
(minutes) in the dark after light induction. g, Phase response curves of 
conidiation rhythms after 2 min of light pulse at different circadian time (CT) 
points (hours). *P < 0.05, **P< 0.01, ***P < 0.001 (unpaired two-tailed 
t-test with Gaussian distribution assumption). 
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normal; however, a light pulse resulted in significantly greater phase 
shifts in the qLRE mutants than in the controls (Fig. 1g and Extended 
Data Fig. 1h). These results are consistent with a previous report® and 
indicate that qrf regulates light resetting of the clock by repressing light- 
induced frq. 

Similar levels of qrf transcripts seen in the frq and fr 8"™ 
strains at DD24 (24h in constant darkness) (Extended Data Fig. 2a) 
indicate that qLRE does not regulate qrf expression in DD. In a strain 
(frq’°;frq.aq) in which the promoter of qrf was replaced with the quinic 
acid (QA)-inducible qa-2 promoter, qrf expression was completely abol- 
ished in the absence of QA (Fig. 2a). On addition of QA, qrf was induced 
but frq levels were significantly decreased, further indicating repres- 
sion of frq by qrf. 

Several observations indicate that qrf expression must be within a 
certain range to permit a functional clock. Without QA, the frq'°;frq.aq 
strain showed arrhythmic conidiation after the first day (Fig. 2b and 
Extended Data Fig. 2b). Moreover, a luciferase reporter (Pfrq-luc)'* in 
the frq’°;frq.aq strain showed that the circadian luciferase activity seen 
in the control strain was abolished, indicating that grf expression is re- 
quired for clock function (Fig. 2c and Extended Data Fig. 2c, d). As QA 
concentration increased, circadian conidiation rhythms were gradually 
restored in the frq’°;frq.aq strain, and at 10 °-10 *M QA, the rhythms 
were similar to those in the control strain (Fig. 2b and Extended Data 
Fig. 2b). At higher QA concentrations, however, the amplitudes of the 
rhythms decreased (Fig. 2b) or became arrhythmic (Extended Data 
Fig. 2b). In addition, the phase of the rhythms was significantly delayed 
without QA (Extended Data Fig. 2e), a defect that was also rescued by 
qrfinduction. Restoration of circadian rhythms of FRQ expression, FRQ 
phosphorylation profiles and frq mRNA oscillation by QA was also 
seen in the frq’°;frq.aq strain (Extended Data Fig. 2f, g). 
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Figure 2 | grfexpression is required for circadian rhythmicities. a, Northern 
blot analysis showing the levels of frq and qrf with/without 1 mM QA 

(0.1% glucose) in LL. Right panel: densitometric analyses of the results. 

Error bars show s.d. (1 = 3). b, Race tube analyses of ‘frq’°sfrq* (left; 

t (10 °M) = 21.6 + 0.3 hours) and frq".frq.aq (right; t (10° M) = 21.5 + 
0.4 hours) in 0.1% glucose with different concentrations of QA, in DD. t, 
period length. c, Luciferase reporter assay showing the normalized frq promoter 
activity after two days in DD, for the wild type (left) and for strain 
frq’°sfrq.aq (right), in the absence of QA. 
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qrf RNA oscillates in DD in the wild type in antiphase to frq (Fig. 3a)’, 
but WC-2 does not bind to the qrf promoter in DD (Extended Data 
Fig. 3a). Moreover, the qLRE mutation did not affect either frq or qrf 
levels in DD (Extended Data Fig. 3b), indicating that the WC complex 
does not regulate qrf transcription in DD. Crucially, a luciferase reporter 
(Pqrf-luc) driven by the qrf promoter showed that the qrf promoter 
activity is not rhythmic in a wild-type strain (Fig. 3b and Extended 
Data Fig. 3c, d). 

Several results indicate that frq and qrfare mutually inhibitory. First, 
ina frq’ mutant, only truncated FRQ protein is made, resulting in high 
frqlevels’* but decreased qrflevels (Extended Data Fig. 3e). Second, when 
the frq’”:frq **™' strain was exposed to light, induction of frq resulted 
in decreased qrf (Fig. 3c). Third, low frq mRNA levels in the wec?*° 
mutant led to elevated qrflevels in DD (Fig. 3d). These results indicate 
that frq inhibits qrf expression. 

To determine whether the rhythmic frq transcription can drive an anti- 
phasic rhythm of qrf without rhythmic qrf promoter activity, we created 
a luciferase reporter construct (Pmin-luc-Pfrq) in which the luciferase 
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Figure 3 | Mutual inhibition of frq and qrf transcription forms a double 
negative feedback loop that is required for clock function. a, Strand-specific 
RT-qPCR results showing the oscillations of frq (black) and qrf (red) in DD. 
Error bars show s.d. (n = 3). b, Luciferase reporter assay showing the 
normalized frq or qrf promoter activity after one day in DD in wild-type strains 
with the Pfrq-luc (top) or Pqrf-luc (bottom) construct. c, Strand-specific RT- 
qPCR results showing the levels of frq and qrfin the frq’°sfrqv**"™ strain 

at DD24 (filled bars) or 60 min in the dark after 2 min of light-pulse treatment 
(open bars). d, Levels of frq and arf in the wild-type (filled bars) and wec?X° 
(open bars) strains measured by strand-specific RT-qPCR in DD. e, Luciferase 
reporter assays showing the normalized luminescence levels in a wild-type 
strain that carries the Pfrq-luc (black) or Pmin-luc-Pfrq (red) construct. 

f, A model of the Neurospora oscillator. g, Mathematical simulation of 
relative frq levels (black traces) in DD without qrf (ky) = 0) (left), with qrf 
(kip = 0.5, ky7 = 0.35, kyo = 0.1) (middle; qrf shown as blue trace), and with 
qrf overexpression (ky, = 0.91) (right). For a description of the variables, 

see Methods. 


30 OCTOBER 2014 | VOL 514 | NATURE | 651 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


sense mRNA was driven by a constitutive promoter’ and antisense 
luciferase mRNA was driven by the frq promoter (Fig. 3e). Wild-type 
strains containing the Pfrq-luc or Pmin-luc (lacking antisense lucifer- 
ase RNA) construct were used as controls. Luminescence in the Pmin- 
luc strain was arrhythmic, but the Pmin-luc-Pfrq strain exhibited a robust 
circadian luminescence rhythm in antiphase to that of the Pfrq-luc rhythm 
(Fig. 3e and Extended Data Fig. 4a, b). These results indicate that the 
antiphasic rhythm of qrf expression is driven by rhythmic frq transcrip- 
tion independently of RNA sequence. Therefore, frq and qrf transcrip- 
tion forms a double negative feedback loop that results in antiphasic 
rhythms of frq and qrf (Fig. 3f). 

Mathematical modelling (Extended Data Fig. 4c)'* demonstrated 
that without qrf the Neurospora circadian oscillator can only generate 
alow-amplitude frq oscillation that damps out quickly (Fig. 3g). When 
the double negative feedback loops were introduced into the model, both 
frqand qrflevels oscillated robustly with antiphasic rhythms. When qrf 
was overexpressed, the frq oscillation was not sustained. These results 
suggest that the previously known circadian feedback loops, although 
not sufficient to sustain a persisting rhythm, are the source of the rhyth- 
micity that is amplified and sustained by the resonance of the mutually 
inhibitory and antiphasic expression of frq and qrf. 

How do qrf and frq inhibit each other? Light-induced frq transcrip- 
tion, frq and qrf levels and circadian rhythms were normal in RNA- 
mediated interference (RNAi) mutants (Extended Data Fig. 5a—-d)””. 
Convergent transcription is also known to induce DNA and histone 
H3K9 methylation in the frq region”®”’, but the deletion of genes required 
for DNA and histone H3K9 methylation also did not affect frq express- 
ion (Extended Data Fig. 5e, f). 

Weintroduced a frq construct (qrf) with the frq promoter deleted into 
the frq’” and frq’°sfrq’**"™ strains at the csr-1 locus (Extended Data 
Fig. 6a). This transgene can express normal levels of qrf without detect- 
able frq expression (Extended Data Fig. 6b). In the frq’°sfrqu P"sqrf 
strains, even though qrf expression was restored to normal levels, frq 
levels were not rescued (Extended Data Fig. 6c, d), indicating that grf 
regulates frq in cis. 

WC binding to the frq promoter initiates WC-dependent frq tran- 
scription, but the qLRE mutation did not affect WC binding at the frq 
promoter (Extended Data Fig. 6e, f). However, levels of frq pre-mRNA 
were significantly elevated in the qLRE mutants (Fig. 4a and Extended 
Data Fig. 6g). Moreover, qrf expression in the frq’’sfrq.aq strain did not 
affect frq RNA stability (Extended Data Fig. 6h). These results suggest 
that qrf regulates frq after transcriptional initiation. 

After transcriptional initiation, the RNA polymerase II (pol II) C- 
terminal domain (CTD) is phosphorylated at Ser 2 and Ser 5, with Ser 5 
and Ser 2 phosphorylation enriched near the 5’ and 3’ ends of tran- 
scribed regions, respectively”. Similar Ser 5 and Ser 2 phosphorylation 
profiles were seen at a Neurospora locus without antisense transcripts 
(Extended Data Fig. 7a, b). In contrast, both modifications of the CTD 
peaked at the same position in the middle of the transcribed frq region 
(Fig. 4b and Extended Data Fig. 8). Mutation of the qLRE, which de- 
creases qrf expression, resulted in decreases in both Ser 2 and Ser 5 phos- 
phorylation. Phosphorylation of pol II can trigger the methylation of 
histone H3 on Lys 36 (H3K36me3)**. H3K36me3 enrichment at the frq 
locus peakepd at the same position as CTD phosphorylation, and the 
qLRE mutation decreased H3K36me3 (Fig. 4b, bottom panel). In the 
frq’°;frq.aq strain minus QA, in which qrf expression was completely 
abolished, the distributions of CTD phosphorylation and H3K36me3 
on frq resembled the control locus lacking antisense transcription (Ex- 
tended Data Fig. 9a—c). These results suggest that pol II stalls in the mid- 
dle of the frq locus as a result of convergent transcription’. 

SET-2 methylates H3K36 in Neurospora” and is required for clock 
function’’. Even though frq and qrf levels were increased only modestly 
in the set-2“° single knockout mutants, their levels were markedly ele- 
vated in the frq’;set-2“° double mutant (Fig. 4c and Extended Data 
Fig. 10a), suggesting that H3K36me3 contributes to the suppression of 
frqand qrf transcription. However, the induction of qrf still resulted in 
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Figure 4 | grftranscription results in pol II stalling, premature termination 
of transcription, and chromatin modifications. a, Strand-specific RT-qPCR 
results showing the levels of intron-containing frq after a 2-min light pulse at 
DD?24 in the indicated strains. The numbers on the x axis represent the time 
points (minutes) in the dark after light induction. b, ChIP assays showing the 
relative enrichment of pol II Ser 5 (top histogram), Ser 2 (middle) and 
H3K36me3 (bottom) in the frq locus in LL. IgG was used as the mock 
control for IP. dLRE, distal LRE; pLRE, proximal LRE; qLRE, qrf LRE. 

c, Northern blot analyses showing the levels of frq and qrf in LL in indicated 
strains. d, Northern blot analyses showing the levels of frq and qrf with/without 
1mM QA (0.1% glucose) in LL in indicated strains. e, Northern blot analysis 
in LL. The addition of QA results in rrp44 silencing. frq-N term and frq-C term 
are specific for the 5’ or 3’ half of the frq transcripts, respectively (shown in 
b). Long exp., long exposure; kb, kilobases. f, Northern blot analysis in LL. 
The ratios between truncated and full-length frq transcripts from three 
independent experiments are shown. Error bars show s.d. (n = 3). 

**P < 0.01 (unpaired two-tailed f test with Gaussian distribution assumption). 


a decrease in frq in the set-2*°, frq'° double mutant strain (Fig. 4d and 
Extended Data Fig. 10b), indicating the existence of another mecha- 
nism that mediates the action of antisense transcription. 

We hypothesized that the stalling of pol II as a result of convergent 
transcription may abort transcription prematurely, which should result 
in truncated transcripts from their respective 5’ ends but not from the 
3' ends. Because these truncated transcripts are normally unstable, we 
first screened a panel of Neurospora mutants for the nuclease that de- 
grades such transcripts. The RNA exosome is involved in degrading full- 
length frq transcripts’®. Northern blot analysis using an RNA probe (frq-N 
term or qrf-N-term) specific for the 5’ end of frq or qrf, respectively, 
revealed that the silencing of rrp44, which encodes the exosome cata- 
lytic subunit, resulted in the appearance of a low-molecular-mass RNA 
smear in the wild-type strain (Fig. 4e and Extended Data Fig. 10c). In 
contrast, a probe specific for the 3’ half of frq failed to detect the RNA 
smear. The amount of the RNA smear was decreased markedly in the 
qLRE mutant, which has reduced levels of grf expression, (Fig. 4f) and 
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was not observed for a control gene without antisense transcripts (Ex- 
tended Data Fig. 10d). Taken together, these results suggest that both 
premature transcription termination and chromatin modifications con- 
tribute to the transcriptional inhibition of frq by qrf. 

We discovered here that the sense and antisense transcription of frq 
forms a double negative feedback loop that is interlocked with the core 
circadian feedback loops in Neurospora. The mutual transcription inter- 
ference of frq and qrf results in antiphasic oscillations of frq and qrf that 
resonate to achieve robust and sustained circadian gene expression. In 
the silkmoth and mouse liver, antisense per RNAs exist and were also 
found to oscillate in antiphase to sense RNAs”, suggesting that a 
similar mechanism may function in animal circadian systems. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Strains, plasmid constructs, and growth conditions. The wild-type strain used 
in this study was 87-3 (ras-1bd, a). The frq’” strain is a frq-null and qrf-null mutant’. 
The frq’ strain bears a frame-shift mutation in the frq open reading frame (ORF)'*. 
The mutants we- 1”, we-2®?, weeP®? (we- 1”, we-2*°), deIPX° (del-1”, del-2*°), 
qde-1*°, qde-2""” and qde-3*° were generated in previous studies'?*!. The dim- 
26° and dim-5*° strains were provided by Q. He*’. The set-2%° (FGSC no. 15505) 
strain was from Fungal Genetic Stock Center (FGSC). The set-2“°.frq’ double mutant 
was created in this study by crossing set-2° and frq’. The dsrrp44 strain was gen- 
erated by introduction ofa plasmid expressing QA-driven rrp44-specific RNA hair- 
pin into a wild-type strain”®. The fq’! and frqX/82"™ strains were created in 
this study (Extended Data Fig. 1b, e). The frqe('*"".dsrrp44 strain was created 
by crossing frq(18""™? and dsrrp44. The frq’°sfrq”, frq’sfrqn Pe", frq’°frq.aq 
strains were obtained by targeting plasmids pKAJ120, pKAJ1207 **"™' and pKAJ120. 
aq, respectively, into the his-3 locus of the frq’’ mutant as described previously'® 
(Extended Data Fig. 1c). The frq’®;frq? **"".qrf strain was generated by targeting 
the plasmid pCSR1.qrf into the csr-1 locus of the frq’®;frq’ "=" mutant as de- 
scribed previously** (Extended Data Fig. 6a). 

The pKAJ120 containing the entire wild-type frq gene, including its promoter 
and a his-3 targeting sequence, was used as the parental plasmid'*. The qLRE in 
pKAJ120 was mutated as described (Extended Data Fig. 1b) by site-directed muta- 
genesis to create pKAJ1207 "=". The fragment between BssHII and SapI sites of 
pKAJ120 was replaced with an inverted Neurospora qa-2 promoter to create pKAJ120. 
aq. The PCR fragment containing entire wild-type qrf and its promoter (primer, 
5'-TTCATTAAGGTGGGGCAGG-3’ and 5'-TTTCCACGCCGGCCCCAGTC-3’) 
was inserted into vector pCSR1 between NotI and PstI sites to generate pCSR1.qrf?. 

Growth conditions were as described previously’’. Liquid cultures were grown 
in minimal medium (1 X Vogel’s, 2% glucose). When QA was used, liquid cultures 
were grown in low-glucose medium (1 X Vogel’s, 0.1% glucose, 0.17% arginine) 
with indicated concentrations of QA. For rrp44 knockdown assay, Neurospora was 
cultured into mats in low-glucose medium with 10-7 M QA for two days. After- 
wards, Neurospora mats were cut into discs and cultured in flasks in the same medi- 
um with shaking. After two days the tissues were harvested. For the mRNA decay 
assay, the cultured conditions were as described previously”*. For rhythmic experi- 
ments, the Neurospora cultures were transferred from LL to DD at time 0 and were 
collected in DD at the indicated time (hours). For light induction, Neurospora cul- 
tures were grown in DD for 24h with shaking, and then treated with a 1,750 Ixlight 
pulse. Afterwards, Neurospora cultures were transferred back to DD and collected 
at the indicated time (minutes). For race tube assay, the medium contained 1 X 
Vogel’s, 0.1% glucose, 0.17% arginine, 50 ng ml‘ biotin and 1.5% agar with indi- 
cated concentrations of QA (for results shown in Extended Data Figs 2b, glucose 
was not present in the medium). Strains were inoculated and grown in constant light 
at 25 °C for 24 h before being transferred to DD at 25 °C. Calculations of period 
length and phase were performed as described**. 

Analysis of phase response to light. The assay was performed in race tubes con- 
taining acetate/casamino acid medium (1 X Vogel’s, 1.2% sodium acetate, 0.05% 
casamino acid hydrolysate, 1.5% agar)**. Race tubes were first grown in LL at 25 °C 
for 48 h before being transferred to DD. Cultures were then grown in DD at 25 °C 
for 25h, and different individual cultures were given a 2-min light pulse (1,750 lx) 
at different times (2-h intervals) to cover an entire circadian cycle. The amount of 
phase shift was determined by comparing light-treated cultures (six replicas for each 
time point) with those of the control cultures (kept in DD). The initial LL to DD 
transition was defined as circadian time (CT) 12. The phases of the cultures were 
calculated as the average phase for two consecutive days after the light treatment. 
Protein and RNA analysis. Tissue harvest, protein extraction and western blots 
were performed as described previously’. For protein separation, 40 pig of total 
protein was loaded in each lane of SDS-PAGE (7.5%). Total RNA was extracted 
with Trizol in accordance with the manufacturer’s protocol, and then further puri- 
fied with 2.5 M LiCl as described previously**. Northern blot analyses were per- 
formed as described previously using [*7P] UTP-labelled riboprobes'®. Riboprobes 
were transcribed in vitro from PCR products by T3 or T7 RNA polymerase (Ambion) 
with the manufacturer’s protocol. The primer sequences used for the template am- 
plification were frq-N term (5'-TAATACGACTCACTATAGGG (T7 promoter) 
GGCAGGGTTACGATTGGATT-3’ and 5'-GGGTAGTCGTGTACTTTGTCAG- 
3’), frq-C term (5'-TAATACGACTCACTATAGGG (T7 promoter) CCTTCGT 
TGGATATCCATCATG-3’ and 5’-GAATTCTTGCAGGGAAGCCGG-3’), qrf- 
N term (5’-AATTAACCCTCACTAAAGGG (T3 promoter) GAATTCTTGCA 
GGGAAGCCGG-3’ and 5’-CCTTCGTTGGATATCCATCATG-3’) and NCU01953 
N-term (5’-TAATACGACTCACTATAGGG (T7 promoter) GITGCCAAAGAG 
TIGGCCATTC-3’ and 5’-CTTGCACCACAAACTGTTGAAC-3’). Except for re- 
sults shown in Fig. 4e, f, riboprobes of frq-C term and qrf-N term were used for 
detecting frq and qrf, respectively. For Fig. 4e, f and Extended Data Fig. 10, the 
riboprobes were hydrolysed for 25min in 40 mM NazCO3, 60 mM NaHCOs, 


10 mM 2-mercaptoethanol; the reaction was then stopped with an equal volume 
of 0.2 M sodium acetate pH 6.0, 1% acetic acid, 10 mM dithiothreitol (DTT). For 
western blotting and northern blotting, densitometry analyses were performed 
with Image J. 

Quantitative PCR and RT-qPCR were performed as described previously”®. For 
strand-specific RT-qPCR, several modifications of the protocol were made. The 
primer sequences for strand-specific RT reactions were frq (5'-GCTAGCTTCAG 
CTAGGCATC (adaptor) CGTTGCCTCCAACTCACGTTTCTT-3’), qrf (5’-CC 
TCAGCTCGTACGAGTCGTAC (adaptor) GICATGGAGCCCTCTGGTCTTG 
GT-3'), frq pre-mRNA (5'-GCTAGCTTCAGCTAGGCATC (adaptor) TTGAA 
CGGTAGGGAGGAGGAGAG-3’) and f-tubulin (5'-CTCGTTGTCAATGCAG 
AAGGTC-3’). The RT reaction was performed by mixing the primers of specific 
strand and f-tubulin. $-tubulin was used for internal control. The primer sequences 
for the qPCR step of the RT-qPCR assay were frq (5’-AGCTTCAGCTAGGCA 
TCCGTT-3’ and 5'-GCAGTTTGGTTCCGACGTGATG-3’), qrf (5'-CAGCTC 
GTACGAGTCGTACGTC-3’ and 5’-ATCTTCCGATGTTGTCGAGCGT-3’), frq 
pre-mRNA (5'-AGCTTCAGCTAGGCATCTTGAACG-3’ and 5'-ACGGCATC 
TCATCCATTCTCACCA-3’) and f-tubulin (5'-ATAACTTCGTCTTCGGCCA 
G-3' and 5'-ACATCGAGAACCTGGTCAAC-3’). 

Strand-specific RNA sequencing. Conidia (~10° ml") were inoculated into 30 ml 
of Vogel’s minimal medium with 2% sucrose and incubated at 25 °C for 20 h with 
shaking. The mycelia were filtered through Whatman paper and washed once with 
ice-cold phosphate-buffered saline. About 200 mg was put into a 2.0-ml cryo vial 
with silicon beads and frozen in liquid nitrogen. Total RNA was isolated from fro- 
zen mycelia in 1.0 ml of TRIzol using a BioSpec beater. Total RNA (5 jig) was run 
out on a 1.0% agarose gel to check for degradation. The 3’ paired-end RNaseq li- 
brary was prepared from 10 jig of total RNA as described”. In brief, the poly(A)* 
fraction was isolated on oligo(dT); magnetic beads (Invitrogen) and then treated 
to chemical fractionation with ZnCl (Ambion) followed by another round of poly(A)* 
enrichment. First-strand cDNA was generated from an anchored oligo(dT). pri- 
mer followed by second-strand synthesis with DNA polymerase I. The addition of 
paired-end adaptors and size isolation with ampureXP beads (Ambion) was fol- 
lowed by PCR enrichment as described’’. The library has an average insert size of 
200 + 36 bp. The library was sequenced to a depth of 22 x 10° 36-bp paired-end 
reads on an Illumina GAIIx sequencer. 

Nascent nuclear transcripts isolation. Liquid cultures were grown in 11] of min- 
imal medium (1 X Vogel’s, 2% glucose). The dried frozen tissue (~4 g) was ground 
in liquid N>. 8 ml of cold buffer A (1 M Sorbitol, 7% Ficoll, 20% glycerol, 5 mM 
magnesium acetate, 3mM CaCl, 50 mM Tris-Cl pH7.5, 3mM DTT, 10 pg ml! 
leupeptin, 10 jig ml! pepstatin, 10 pg ml” ' phenylmethylsulphony fluoride (PMSF)) 
was mixed with tissue powder on ice for 5-10 min with stirring. The mixture was 
filtered through four layers of pre-wetted miracloth (buffer A) into small flask on 
ice. Buffer A was added to the filtered mixture to a volume of 8 ml. Pre-chilled buff- 
er B (10% glycerol, 5 mM magnesium acetate, 25 mM Tris-Cl pH 7.5, 10 pg ml! 
leupeptin, 10 jg ml pepstatin, 10 pg ml” ' phenylmethylsulphonyl fluoride) (16 ml) 
was slowly added with stirring. The mixture obtained was gently layered on 10 ml 
of pre-chilled buffer A/B (2.5:4) in a 50 ml screw-cap tube. The supernatant was 
collected after centrifuging at 3,000g for 7 min at 4 °C. Supernatant (33 ml) was 
gently layered on 5 ml of pre-chilled buffer D (1 M sucrose, 10% glycerol, 5 mM 
magnesium acetate, 25mM Tris-Cl pH7.5, 1mM DTT, 10pg ml! leupeptin, 
10 pg ml pepstatin, 10 1g ml’ PMSF). The pellet (purified nuclei) was collected 
after centrifugation at 9,400¢ for 15 min at 4 °C. The pellet was suspended in 1 ml of 
Trizol. Total RNA was extracted with Trizol in accordance with the manufacturer’s 
protocol. Contaminated DNA was removed with a TURBO DNA-free kit, follow- 
ing the manufacturer’s protocol. Treated RNA (50 ng) was used for strand-specific 
RT-qPCR. 

Chromatin immunoprecipitation (ChIP) assay. The ChIP assay was performed 
as described previously, with several modifications*. First, 500 1g of total cell lysate 
was used for each immunoprecipitation assay. The antibodies used in this study 
were WC-2 (ref. 12) (2 pl per assay), Pol II CTD (Abcam no. ab26721, 2 1g per 
assay), Pol IIS2P-CTD (Abcam no. ab5095, 2 jig per assay), Pol II S5P-CTD (Abcam 
no. ab5131, 2 1g per assay), histone H3 (Abcam no. ab1791, 2 1g per assay) and 
H3K36me3 (Abcam no. ab9050, 2 jig per assay). Each experiment was performed 
independently three times, and immuoprecipitation with IgG or wc” *° extract 
was used as the negative control. qPCR was used for measuring the immunopre- 
cipitated DNA. Primer sequences for qPCR were frq dLRE (5'-AGAGTTTGGC 
CGGACAACCAGTA-3’ and 5'-GCTTCGACCGAAAGTATCTTGAGCCT-3’), 
frqpLRE (5'-GTCGCAGAGGACCCTGAACTTT-3’ and 5’-TCCCACAGATGC 
ACAGGAATCG-3’) and qrf qLRE (5'-ATCGATCACTAGTCCCGGTTCGTT-3’ 
and 5’-TTGCTGATAATGCGCTGAGGGTCT-3’). For the experiments in Fig. 4b 
and Extended Data Figs 8 and 9 the primer sequences were 1 (5'-AGAGTTTG 
GCCGGACAACCAGTA-3’ and 5’-GCTTCGACCGAAAGTATCTTGAGCCT-3’), 
2 (5'-GGGTAGTCGTGTACTTTGTCAG-3' and 5'-ACCGGACTTTAGGTTG 
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TGTG-3’), 3 (5’-ACGGCCTTTCTCTGTTTACC-3’ and 5'-GCAGAGTTGGG 
TCGGATTTA-3’), 4 (5'-CCGTTCTAGCGTGCTTCTT-3’ and 5'-GGCACTA 
ATGAGGTTCGAGATT-3’), 5 (5’-TTACTTCATCTTCCGCACTGG-3’ and 5’- 
GGCAGGGTTACGATTGGATT-3’), 6 (5’-ACTGTTCACCGGAAAGATCAG- 
3’ and 5'-GGGCCATGTTGGTTCCTT-3’) 7 (5'-TTGCACCGATCTTTCAGG 
AG-3' and 5'-CTGCAGCACATGTTCAACTTC-3’) and 8 (5'-GGGCAGAGAG 
TGGCTATAGTA-3’' and 5'-CCGCGTCTTCTTCTCACATAG-3’). For the exper- 
iments in Extended Data Fig. 7 the primer sequences were 1 (5'-CAAGTGGG 
CCCTACAGTTATT-3’ and 5'-CCAATCCACTTCCCTTTCCA-3’), 2 (5'-CA 
CCGTGTTCGTGCTTACT-3’ and 5’-GGTATCCGTCTTGTCCCTTTG-3’), 3 
(5'-TCAAGATACGGAGCGAAGAATG-3’ and 5’-GATTGTAGCTGTTCCAC 
CTCTC-3’), 4 (5’-TGACCGACCCAGTTGATTTC-3’ and 5’-GTGGCACTTC 
TATCACCTTCTT-3’), 5 (5’-CGACCGTTAGGAGACCAAATAG-3’ and 5'-A 
ATGGCTTCCTTGTGGTTAGA-3’), 6 (5'-CTTAGTCGGATCAGTGGCTATT-3’ 
and 5’-TTCGTAGATGCCGTGAGATG-3’), 7 (5’-CGTGGACTTCTGCTACT 
CTTC-3' and 5’-CCTTTGCAGTCCTCCTCTTT-3’) and 8 (5’-GTCTGGGAG 
CTTCTGTTGAATA-3’ and 5'-TAAAGTGGTGAACGACCTCATC-3’). 
Luciferase reporter assay. The luciferase reporter assay was performed as described 
previously'®. The reporter construct in Fig. 2c (Pfrq-luc) was generated by inserting 
the BamHI-Not! fragment of Pfrq-luc-I (a gift from J. Dunlap) into corresponding 
sites of pBARKS1 (ref. 16). The reporter construct in Fig. 3b (Pqrf-luc) was gener- 
ated by inserting the PCR fragment containing the grf promoter and the luciferase 
gene into NotI-EcoRI sites of pBARKS1. For the PCR fragment, the qrf promoter 
was directly fused with the luciferase gene ORF. The primer sequences were qrf 
promoter (5’-ATCGATTTCATTAAGGTGGGG-3’ and 5'-GGTGGTGGGTGG 
TGGGTGGAG) and luciferase gene (5’-ATGGAGGACGCCAAGAACATCAA 
G-3’ and 5'-TCAGAGCTTGGACTTGCCGCC-3’). The plasmids Pfrq-luc and 
Pqrf-luc were transformed into wild-type or frq’”;frq.aq strains with Ignite selec- 
tion as described previously’. The plasmid Pmin-luc was generated by inserting 
the luciferase gene ORF into the XmalI site of pDE3dBH. The plasmid Pmin-luc- 
Pfrq was generated by inserting the inverted frq promoter (the same as Pfrq-luc-I) 
downstream of the luciferase gene ORF of Pmin-luc. The non-regulated basal pro- 
moter in pDE3dBH (upstream of the EcoRI site) was used as the minimal promoter 
to control the expression of the luciferase gene in Pmin-luc and Pmin-luc-Pfrq as 
described previously’. For Fig. 3e, the plasmids Pfrq-luc-I, Pmin-luc-Pfrq and Pmin- 
luc were transformed into the wild-type strain by targeting the his-3 locus. Under our 
experimental conditions, luciferase signals are highly variable during the first 1-2 
days in the LumiCycle and only become stabilized afterwards, which is probably due 
to an artefact caused by the transfer of the cultures from light to dark. Thus, the 
results presented were recorded after 1-2 days in DD. 
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Mathematical modelling. The mathematical simulation of relative frq and qrf RNA 
levels in DD was performed on the basis of a previously described model’**””, with 
several modifications. The equations used here are listed in Extended Data Fig. 4c. 

The differential equations (1-8) were solved numerically by the Gear method and 
XPPAUT*™. The parameters were as follows. The values of ky, ks, ke, kz, kg, ko, ku, 
kya, ky3, kya, ks) K, Ky and ko, were from ref. 18. The values of k, and k; were from 
ref. 40, 41. The value of k1o was from ref. 42. The values of other parameters were 
determined in this study. For Fig. 3g, ky = 0 (left panel); kyz = 0.35, kyy = 0.1 (mid- 
dle panel); and ky. = 0.1, ki7 = 0.35, ky = 0.91 (right panel). 
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Extended Data Figure 1 | Light-induced qrf expression represses frq control. e, Diagram describing the strategy used to obtain the knock-in strains 


transcription and regulates light resetting of the clock. a, Strand-specific by homologous recombination. f, Strand-specific RT-qPCR results showing 
RNA-Seq result of the frq locus, showing the overlapping frq and qrftranscripts. _ the expression levels of frq and qrf in the indicated knock-in strains in LL. 

b, Sequence alignment of known LRE elements. The qLRE and the mutated g, Western blot results showing the FRQ expression levels in the indicated 
regions in the qrf promoter are shown. c, Diagrams showing the chromosomal _ strains in LL. The densitometric analysis of western blot results from three 
modifications in the indicated loci in the frq’®;frq’”’ and frq’°;frq"*"" strains. independent experiments is shown at the right. Error bars show standard 

C, Clal; B, BglII; E, EcoRV. d, Strand-specific RT-qPCR results showing the deviations. **P < 0.01. h, Phase response curves of circadian conidiation 
expression levels of frq and qrf in indicated strains in LL. Error bars show rhythms of the indicated knock-in strains after 2 min of light pulse at different 


standard deviations (n = 3). **P < 0.01; n.s. indicates that the difference is not circadian time (CT) points. 


statistically significant. frq-RT and qrf -RT represent the non-RT reaction 


©2014 Macmillan Publishers Limited. All rights reserved 


= 
Oo 


< 
=a 
ig 
& 
iS 
6 
ry 
3 
20.5 
o 
2 
& 
oO 
4 


= 
° 


fe 7 cai ie 


in 
) 


LETTER 


Phase (Hrs) 


A (M 

ee frq’9;frq’™ frq’9;frq.aq 
0 11.6+0.3 13.0+0.1 
10°” 11.8+0.3 13.0+0.2 
106 11.4+0.5 12.8+0.1 
15% 11.3+0.3 12.8+0.6 
104 11.5+0.3 12.1+0.2 
10°3 11.9+0.2 11.3+0.3 


b : ; ri 1 _ 1 - i a QA (M) 


Gs 
x 
2 
2 
< 
g 
© 
£ 
oO 
3 
Days in DD 
dg 30 g 
£5 150 
73 
= 0 
gs 
B ¢-150 
6 10. i 
s 300 frq’";frq.aq - QA Days in DD 


= a M 
seuseane : 


Extended Data Figure 2 | grf expression is required for circadian 
rhythmicities. a, Strand-specific RT-qPCR results showing the expression 
levels of qrfin in the indicated knock-in strains at DD24. b, Race tube analyses of 
the frq’°sfrq” and frq’®;frq.aq strains in medium containing 0% glucose and 
0.17% arginine with the indicated concentrations of QA in DD. The lack of 
glucose in medium is known to allow more efficient expression from the qa-2 
promoter. The black lines on race tubes indicate the daily growth fronts. c, The 
unnormalized luciferase activity of the experiments in Fig. 2c. d, The 


[Luciferase | 


frq’°;frq.aq 


Days in DD 
AS a 


yp 
Ss e e SS Ss Ss S 


+ OMQA + 10“¢MQA 


3 
PSA 
0 

Pe & SP oO 


normalized result of Fig. 2c on a X10 scale, showing that fluctuation of the 
luciferase activity in the frq"°;frq.aq strain is random and not rhythmic. e, The 
phases of the first conidiation band in DD of the race tube results shown in 
Fig. 2b. f, Western blot analysis showing the FRQ expression profiles in the 
frq’”;frq.aq strain in the presence and absence of QA in DD at the indicated 
time points. g, Northern blot analysis showing frq expression profiles in the 
frq’®;fcq.aq strain in DD at the indicated time points. The densitometric 
analysis is shown below. 
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Extended Data Figure 3 | Regulation of grfexpression. a, WC complex does 
not bind to the grf promoter in DD. WC-2 ChIP assays showing the relative 
levels of WC binding at the frq (dLRE) and qrf (qLRE) promoters at the 
indicated time points in DD. b, Strand-specific RT-qPCR results showing the 
levels of frq and qrf at DD24 in the indicated strains. c, The unnormalized 
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luciferase activity of the experiments in Fig. 3b. d, The normalized luciferase 
activity of the Pqrf-luc construct in Fig. 3b shown on a X10 scale. 

e, Densitometric analyses of three independent northern blot results, indicating 
that levels of qrf transcripts are decreased in the frq’ strains as a result of 
increased frq expression. 
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Extended Data Figure 4 | Rhythmic transcription of frq drives rhythmic qrf 
expression and mathematical modelling of the Neurospora circadian 
oscillator. a, The unnormalized luciferase activity of the experiments in Fig. 3e. 
b, The unnormalized luciferase activity of the wild-type strain carrying the 
Pmin-luc construct. Results of two independent transformants are shown. 

c, Mathematical modelling of the Neurospora circadian oscillator with the 


double negative feedback loop. The differential equations used in the model are 
shown. The model is identical to a previously developed model” with the 
exception of equation (1), which in this case includes the inhibition of frq 
transcription by qrf, and equation (8), which includes the inhibition of qrf 
transcription by frq. The rate constants used in the simulations are listed below. 
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Extended Data Figure 5 | Neither RNAi nor DNA methylation pathways indicated strains in LL. d, A table showing the period lengths of the conidiation 
have a significant role in the clock. a, b, Strand-specific RT-qPCR results rhythms in the wild type and in different RNAi mutants. e, Strand-specific 


showing the induction of frq after a light pulse are similar in the indicated RT-qPCR results showing that the induction of frq after a light pulse is similar 
strains at DD24. b, Strand-specific RT-qPCR results showing the induction of _ in the indicated strains at DD24. f, Strand-specific RT-qPCR results showing 
frq after a light pulse is similar in the indicated strains at DD24. c, Strand- that the induction of frq after a light pulse is similar in the indicated strains at 


specific RT-qPCR results showing similar expression levels of frq and qrfinthe | DD24. Error bars show standard deviations (n = 3). 
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Extended Data Figure 6 | Characterization of the qrf action on frq double mutant (wec?*°) was used as a negative control for ChIP. n.s. indicates a 


expression. a, Diagram showing the chromosomal modifications in the 

fig? "orf strains that allow the expression of qrf in trans. The frg?™”"™" 
construct is at the his-3 locus, and qrfis expressed only from the csr-1 locus. The 
red dashed line indicates that the frq promoter region is deleted in the qrf 
construct to abolish frq expression. b, Strand-specific RT-qPCR results 
showing that only qrf is expressed from the qrf construct in the frq’“sqrf strain. 
c, qrf expression does not repress frq transcription in trans. Strand-specific RT- 
qPCR results showing the levels of frq and qrf transcripts in the indicated strains 
in LL. Error bars show standard deviations. *P < 0.05 (n = 3). d, Northern 
blot results showing that expression of qrf in trans in the frq'°.frqv""sqrf 
strains does not repress frq expression. Densitometric analysis of the northern 
blot results is shown at the right. e, WC-2 ChIP assays showing the relative 
WC binding levels at the frq promoter in LL in the indicated strains. The we 


lack of statistical significance (n = 3). f, WC-2 ChIP assays showing the relative 
WC-2 binding levels at the frq promoter in LL in the indicated strains. The 
we double mutant (wec?*®) was used as a negative control for the ChIP assays. 
n.s. indicates a lack of statistical significance (n = 3). g, Strand-specific RT- 
qPCR results showing that the mutation of the qLRE element in the qrf 
promoter results in significant increases in light-induced frq pre-mRNA 
expression. Nascent nuclear RNA was used. h, Northern blot results showing 
that the stability of frq mRNA is not affected by the transcription of qrf. The 
frq’*;frq.aq strain that can induce qrf expression in the presence of QA was 
used. Thiolutin, a transcription inhibitor™, was added in the culture to block frq 
transcription so that frq mRNA stability could be determined. Cultures were 
harvested at the indicated time points after the addition of thiolutin. 
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Extended Data Figure 7 | Modified Pol II CTD distribution profiles of a b, ChIP assays showing the relative enrichment of Pol II Ser 5 phosphorylation, 
locus lacking antisense transcripts. a, Top panel: strand-specific RNA-Seq Pol II Ser2 phosphorylation, and H3K36me3 at the NCU01953 locus in the 
results of the NCU01953 locus. Bottom panel: the positions of primers usedin _ wild-type strain. The scale on the y axis is the enrichment percentage of 
ChIP assays and the position of the riboprobe used in northern blot analysis. | immunoprecipitation (IP) over input. IgG was used as the mock control for IP. 
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Extended Data Figure 8 | Modified Pol II CTD distribution profiles at the _ for IP. c, ChIP assays showing the relative enrichment of Pol II Ser5 

frq locus. a, Diagram showing the positions of primers at the frqlocus used _ phosphorylation and Pol II Ser2 phosphorylation (ChIP data in Fig. 4b were 
for ChIP assays and the positions of riboprobes used for northern blot analyses. normalized by Pol II CTD ChIP results in b), and H3K36me3 (ChIP data in 
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the frq locus. The scale on the y axis is the enrichment percentage of **P < 0.01, ***P < 0.001 (n = 3). 

immunoprecipitation (IP) over input. IgG was used as the mock control 
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Extended Data Figure 10 | Mechanism of qrf action on frq expression. 

a, Densitometric analyses of three independent experiments shown in Fig. 4c. 
b, Densitometric analyses of independent experiments shown in Fig. 4d. 

c, Northern blot analysis using a N-term probe (marked in Extended Data 


Fig. 8a) specific for the 5’ half of qrf transcripts in the WT and WT;dsrrp44 
strains in LL. d, Northern blot analysis using a N-term probe (marked in 
Extended Data Fig. 7a) specific for the 5’ half of NCU01953 transcripts in the 
WT; dsrrp44 strain in LL. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13840 


Corrigendum: Immune clearance 
of highly pathogenic SIV infection 


Scott G. Hansen, Michael Piatak Jr, Abigail B. Ventura, 

Colette M. Hughes, Roxanne M. Gilbride, Julia C. Ford, 

Kelli Oswald, Rebecca Shoemaker, Yuan Li, Matthew S. Lewis, 
Awbrey N. Gilliam, Guangwu Xu, Nathan Whizin, 

Benjamin J. Burwitz, Shannon L. Planer, John M. Turner, 
Alfred W. Legasse, Michael K. Axthelm, Jay A. Nelson, 

Klaus Frith, Jonah B. Sacha, Jacob D. Estes, Brandon F. Keele, 
Paul T. Edlefsen, Jeffrey D. Lifson & Louis J. Picker 


Nature 502, 100-104 (2013); doi:10.1038/nature12519 


The Acknowledgements section of this Letter should have included the 
following sentence: “We acknowledge the contribution of M. A. Jarvis 
to the design, construction and initial in vitro characterization of all the 
strain 68.1-derived RhCMV vectors used in this study, including both 
previously published RhCMV/SIV vectors and a previously unpublished 
vector expressing an Mycobacterium tuberculosis Ag85B-ESAT6 fusion 
protein, used as a negative control in this study.” S.L.P. and J.M.T. did 
not reply when asked to approve the wording of this Corrigendum. The 
HTML version of the paper has been corrected. 
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DEVELOPMENT HUBS 


Rust no more 


Ohio’s investments into research are ushering the state from 
amanufacturing past into a knowledge-economy future. 


BY CAMERON WALKER 


centred around industrial manufactur- 
ing and agriculture, thanks in large part 
to alocation on the western side of the histori- 
cally factory-rich region of the United States 


a generations, Ohios economy has been 


and the eastern reaches of the vast fields of the 
corn belt. But in the past decade, this Midwest- 
ern state has begun to transform itself into a 
manufacturing hub ofa different kind. A US$2- 
billion infusion of state funding to develop 
the state’s technology economy, along with 
a cluster of research institutions and a steady 


proliferation of start-ups, have resulted in an 
outpouring of research and tangible goods in 
an array of sectors from pharmaceuticals and 
agriculture to medical testing and devices. 
Much of the state’s research, development 
and technology transfer take place in its larg- 
est cities — especially Cleveland, Columbus 
(the capital) and Cincinnati. That is largely 
because they contain the most institutions, 
including Ohio State University in Columbus 
and the Cleveland Clinic and Case Western 
Reserve University in Cleveland. Research- 
ers there and elsewhere in the state attracted 
$685 million in biomedical funding from the 
US National Institutes of Health (NIH) last 
year, putting Ohio among the ten states that 
receive the most such funding annually. In 
2012, a combination of NIH funding, further 
federal grants, industry funding and private 
money gave Ohios researchers a grand total of 
$1.3 billion to spend on bioscience, according 
to Battelle, a non-profit research organization 
in Columbus, and the Biotechnology Industry 
Organization in Washington DC. 


UP, UP, UP 
All this investment has been producing jobs 
in both academic and industrial bioscience 
research, even during the recent recession. 
Between 2007 and 2012, biosciences employ- 
ment in the state’s private sector rose by 2.2% 
to a total of 48,294 jobs, while the national rate 
remained essentially flat, says Battelle's Ryan 
Helwig. In several sectors, including pharma- 
ceuticals; agricultural feedstock and chemicals; 
and medical devices, which alone involves 
4,523 jobs and just in the Cleveland area, 
employment grew by double digits during 
that period, outpacing national hiring in these 
sectors. Although these jobs include sales and 
manufacturing positions, companies such as 
Neuros Medical near Cleveland are also seek- 
ing research scientists. In 2015, the company 
plans to recruit biomedical engineers, clinical 
managers and field clinical specialists, among 
others, as it continues pivotal clinical trials on 
an implantable nerve-blocking device to treat 
pain for people who have had amputations. 
Much of the growth in medical devices 
and other private-sector ventures stems from 
research initiated at the state’s universities and 
institutions and from researchers who have fol- 
lowed an entrepreneurial path. One of these 
is cardiologist Marc Penn, founder and chief 
medical officer of three biomedical companies, 
including Cleveland HeartLab, which devel- 
ops and performs diagnostic tests to assess > 
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> cardiovascular-disease risk. Although many 
institutions are going through a steep learning 
curve when it comes to technology transfer, 
he says, “the opportunities outweigh the risks” 
for researchers interested in working in Ohio. 

Perhaps the state's biggest system for convert- 
ing its bioscience research into goods, services 
and businesses is a $2.1-billion taxpayer- 
supported initiative called Third Frontier, 
based in Columbus. Launched in 2002, it aims 
to support the development of technology- 
based products, companies, industries and 
jobs; one programme designed to commer- 
cialize biomedical research received more than 
$151 million in state funds in 2013. 

Between 2009 and 2012, Third Frontier 
invested in 1,080 companies, including 205 in 
agricultural bioscience and medical technolo- 
gies. Its investments were directly responsible 
for creating or retaining 7,780 jobs during this 
period. 

Two of the most recent recipients are in 
turn funding bioscience research in Ohio and 
nationally. In June, Third Frontier awarded 
the Harrington Discovery Institute at Univer- 
sity Hospitals in Cleveland — the non-profit 
arm of The Harrington Project for Discovery 
& Development — $25 million to advance its 
efforts to commercialize drug discoveries. 
As well as giving physician-scientists in the 
United States a base amount of $100,000 over 
two years, the institute connects them with 
entrepreneurial and pharmaceutical develop- 
ment teams to help to shepherd their findings 
to market. Some 31 projects are en route to 
becoming drugs, with a biomedical start-up 
around each. The Harrington Project plans 
to support at least 50 


such projects at any “There are a lot 
given time, says Baiju Of opportunities 
Shah, chief execu- withthe 

tive of BioMotiv, The companies that 
Harrington Project's we’re developing 
for-profit arm.Shah to attract 

says that BioMotiv scientists and 


may be seeking pro- 
ject managers, partic- 
ularly those who have experience in managing 
pharmaceutical development, to lead those 
fledgling businesses. 

Third Frontier has also supported Ohio's 
growing neurotechnology field. In June, it 
awarded $21 million over four years to the 
Neurotechnology Innovations Translator, a 
for-profit partnership between Ohio State and 
other research institutions, health-care com- 
panies and venture-capital firms among oth- 
ers that aims to create and grow neuroscience 
technology businesses and will begin funding 
projects in early 2015. 

“There are a lot of opportunities with the 
companies that we’re developing to attract 
scientists and engineers,” says Ali Rezai, a neu- 
rosurgeon at Ohio State and principal investi- 
gator for the translator. He estimates that the 
companies that emerge from the project will 


engineers.” 


‘= 


At the Global Center for Health Innovation, researchers can showcase their innovations and technologies. 


generate at least 160 jobs, including positions 
for researchers and engineers. 

Ohio State is also expanding its faculty as 
part of an initiative that seeks to produce col- 
laborative research, teaching and outreach in 
theme areas of food production and security, 
energy and the environment, and health. In the 
next 10 years, it intends to increase the number 
of tenured and tenure-track faculty members 
by 8-10%, or about 300, with a particular focus 
on bioscience disciplines related to the initia- 
tive’s themes. 

At present, the university has set aside 
$60 million for tenure-track positions in data 
analytics, emerging and re-emerging diseases, 
sustainable materials and food to improve 
health, with the first recruits expected on cam- 
pus by autumn 2015. 

In June, an effort was launched to bring 
Ohio’s research institutions into a network 
that will help researchers to develop and run 
clinical trials in areas such as neuro-oncology, 
neurosciences, paediatrics and infectious 
diseases. Organizers of the Cleveland-based 
Ohio Clinical Trials Collaborative, a multi- 
institution partnership that has received two 
years of state funding, intend for it to become 
self-sustaining by 2016. 

It has already made many national and inter- 
national industry contacts and has close to 30 
trial opportunities in development, says John 
Peterson, the project's global business-devel- 
opment director. As more industry-sponsored 
trials come to the state, the collaborative and 
participating hospitals will need more research 
and support staff, he says. 


ON DISPLAY 

Several large multinational and national 
companies have headquarters or outposts 
in Ohio, from Proctor & Gamble to Battelle. 
Last autumn, Cleveland opened the 2-hectare 
Global Center for Health Innovation adjacent 
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to the new Cleveland Convention Center. The 
centre has four floors of permanent show- 
rooms that companies and institutions — 
including Philips and University Hospitals 
— are occupying. “That’s another symbolic 
commitment of the state and the region” to 
bioscience, says Aram Nerpouni, president 
and chief executive of BioEnterprise, a Cleve- 
land-based business formation, recruitment 
and acceleration programme. 

From 2002 to this year, Cleveland's biomedi- 
cal cluster grew from 300 companies to more 
than 700. Around the state, business-support 
organizations such as the Youngstown Busi- 
ness Incubator and TechColumbus — as well as 
bioscience-specific efforts, including the Global 
Cardiovascular Innovation Center in Cleveland 
— help entrepreneurs to launch start-ups. 

Before Goutham Narla, a medical geneticist 
at Case Westerns School of Medicine, came to 
Cleveland from Mount Sinai Hospital in New 
York City to explore a potential job, the state of 
Ohio had never been on his radar. But now he 
and his wife, cancer biologist Analisa DiFeo, 
love it. Along with its world-class researchers, 
the region has an ideal combination of ambi- 
tion and amiability. It is not so laid-back that 
nothing gets done, but also “not so cut-throat 
that you're wondering if you can trust anyone, 
even your colleagues’, Narla says. 

The five graduate students who came with 
Narla from Mount Sinai have enjoyed the 
regions lower cost of living and easier entrée 
to top-shelf research institutions and research- 
ers. Early-career scientists often find it difficult 
to tap into the high-level networks based on 
the east and west coasts, Narla says, but that 
is not the case in Ohio. “That's a neat thing 
here — you have a lot of access, because people 
are more welcoming.” m 


Cameron Walker is a science writer in Santa 
Barbara, California. 
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DUMPSTER DIVING 


BY ALVARO ZINOS-AMARO 


aniel Weatherby, 45, wealthy and 
D fast dying from complications of a 

malignant neoplasm in the pharynx 
— throat cancer — was cut off from any 
living relatives, and unattached. But the 
reality ofhis aloneness and imminent death 
didn't particularly bother him. He was going 
to be cryogenically preserved. 

“Would you like another pillow?” the 
nurse asked, and waited for him to gesture 
with his head. 

Daniel had lost the use of his throat weeks 
ago. A man who prized silence above all else, 
he now wondered whether in a way the disease 
— ifit weren't for the dying part — might not 
be a blessing in disguise, as he was no longer 
expected to talk. But lately he spent a lot of time 
drifting in and out of consciousness; his hands 
often turned blue; and his mouth was continu- 
ally dry, his lips cracked. All this he could do 
without. He nodded and the nurse propped 
him up on a second foam cushion. She used 
glycerin swabs and lip balm to soothe his pain. 

“Anything else I can do for you?” 

Daniel ignored her. He had realized soon 
after being admitted to the hospital that one 
way of dismissing a caregiver was simply to 
stop acknowledging them. 

A few moments later, when he peed him- 
self and died, Daniel was pleased that at least 
he did so quietly. 


He was dreaming that he was back on the 
sprawling Wisconsin farm of his youth. 
Then his eyes adjusted to the brightness of a 
small grey room, without windows or doors. 

The first thing he felt was a complete 
absence of pain. He cleared his throat, which 
seemed to be working fine. He had been 
healed. Tears welled in his eyes. 

He blinked. Two tall, identically dressed 
figures appeared by his bedside. 

“What is this place?” Daniel asked. 

The man and woman, eyes closed, did 
not reply. 

Daniel tried to get out of bed, but was so 
weak he almost passed out. 

“What year is this?” 

The woman inclined her head in Daniel's 
direction, and then the man followed suit. 
Their eyes remained closed. 

“Hey, you two, I’m right here! What’s 
going on?” 

The man smiled first, then the woman. 

The year, by your calendar, is 2189, an 
inhumanly clear voice said in Daniel’s head. 


The silent treatment. 


“Please talk to me the old-fashioned way.” 

The old ways are gone, the voice said. They 
were problematic and had to go. 

Daniel’s head began to throb. This was 
more than dizziness. 

We are very pleased about the condition 
of your mind, the voice said. Spacious, 
unadulterated. 


Daniel gritted his teeth. “Stop doing what- 
ever youre doing. It’s hurting me” 

The examination is now over, the voice said. 

His head returned to normal. Or did it? 
Daniel blinked. Now there were eight indi- 
viduals, four men and four women. “Look, 
I really appreciate what you've done for me,” 
he mumbled. “Bringing me back to life and 
all. But I need some fresh air, assuming it still 
exists in 2189. Can you show me outside?” 

This time he managed to stand up, naked 
and all. 

We've done more than that, Daniel. We’ve 
made you immortal. 

A vision flashed through Daniel's mind. 
Idyllic parks, bountiful sunshine. Pristine 
mountains, lakes, oceans, forests, rivers. 
Extinct species restored. A thriving plan- 
etary ecosystem. Everyone smiling and in 
perfect health. No disease, no crime. No 
poverty in the world, no poverty of the mind. 
Everyone happy, all the time. Everyone 
generous and kind — 

And ofakind. 

“You're all... the same?” 

One voice, said the 


> NATURE.COM 


voice. 
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his captors on his way to the farthest wall, 
which he planned to bang on with his bare 
arms and fists until someone let him out. 
Instead, he passed right through the figures 
and tumbled to the floor. 

We have no wish to assimilate your con- 
sciousness into ours, said the voice. We prize 
that about you, that you have not heard our 
voice before. For we are always in need of 
neuronal space, and a single unconnected 
mind like yours would be of great value to 
our network. 

Daniel struggled into an upright position. 
“What the hell do you want with my mind?” 

The eight figures opened their eyes, 
revealing no pupils, just whites. Daniel 
preferred them the way theyd been before. 
“There were no buildings in what you 
showed me,’ he said. “We're underground, 
arent we?” 

A perfect world, he realized, in which all 
the machinery had been hidden away. And 
not just the machinery — but everything 
that was unwanted. 

The refuse. 

In order to maintain our calm, the voice 
said, we must continuously shed that part of 
ourselves that would lead us back to the old 
ways. We must shed our jealously, greed, vio- 
lence and lust. Our pain and suffering. 

The figures disappeared, until only one 
remained. 

We sift through the biological scraps that 
survive from the past and occasionally get 
lucky. As with you. Perfectly preserved. Func- 
tional. Your mind will allow us to store the 
collected anguish of two billion people —a 
full tenth of the planet’s population — for 
years. You will perform a great service for 
humanity, Daniel Weatherby. Because of 
this, and because you will perform it forever, 
we wished to exercise a minimum of compas- 
sion and inform you of your fate. Thank you 
for your contribution. 

“T don’t want to —” 

Daniel’s voice choked off. Everyone was 
gone. 

He was alone in the room. 

But not alone. Endless horrors unspooled 
inside his mind. 

Tears fell from Daniel's eyes as he realized 
he would never again speak. m 


Alvaro Zinos-Amaro is co-author, with 
Robert Silverberg, of When the Blue Shift 
Comes (March 2014). His short fiction has 
previously appeared in Nature, Analog and 
Galaxy’s Edge. 
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